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Abstract of JP6121689 

PURPOSE:To elucidate the whole genetic 
sequence of a rton-A non-B (NANB) hepatitic 
virus HC^J6 strain and detect the NANB 
hepatitic virus with high sensitivity based 
thereon. CONSTITUTION :This is an invention 
of an NANB hepatitic virus gene and relates to 
a polynucleotide, a cDNA clone, a polypeptide, 
a polyclonal antibody and a monoclonal 
antibody related thereto and further a NANB 
hepatitic virus detection system using them. 
Thereby, this antibody is capable of detecting 
the NANB hepatitic virus with a high sensitivity 
and the related polynucleotide and cDNA 
clone are capable of producing the detection 
system. 
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[tt#« 1 ] ESIJ## 1 JH«E0ffi«E5l|£^-r-5*A* 
[M##2] RNA5' *«*y 3 4 0®fl)5l?U#^ 

^Dissassa- K-rafa*s#«r9 o 9 gsrox^ 

U**h\ ftt.tfl-3' ^Sffi 1 088<DU^ H/9fS 

at i 5 omoxt u** K**fts. 2***35 i 
ana-waft***- a # a # b siff $ «r ;ux gsttae 

[Si*®4] iB5W^2l8*ffl«*ia5i|£**-a#'j:a 

2 h'N-BBBS, 

[t«*«5] i29JS^3EI£a>£SBB$i|£5t-.6cDN 
A£o— 6-4 8 1 „ 

[»*«6] E5l|#^4I3Ka)ffiSE5»J**-r-Sc DN 

A£P— > J 6-08. 

* Ktanawaft^tt-ra^AifBffliff*^^^** 

[If*® 9] E«a^S|B«ffl7 5/»ffi5iIS#-r-5*° 
V*?** h*P-3 0 3 3. 
[gft*%1 O] »*E£i:«ftl> LIS 7 

^'JX^Utf h\ cDNA9P->ff>±Bf*fcl* 

[»*«i i] iS*«*8«fetvL*i o«ia*<n^'j 

[ffi*«i 2] a***safti»Lai o«ia«©*u 
^^-r K^fflnt^A^BasFsatti*,, 

[R«*i 3] Mt«fti i «tB«0>!/if*£ffl(.>fc#A 
#BS£lff&8&Hi*, 
(SUHtDSMfflftSftflfn 
[0001] 

[S*±(Dfi|ffl#5] *A#BSIFie>-OU 
X (KIT rNANBFS'5'f^J tBgfata) «£ 

k\ 46tficiaaftis. tsflEftttssi-iM-ta. 

[0 0 0 2] 

ASHJF&, B£ffft«tft«. Z*l«*ftt-ftLT. #A 
#B£!|ff* {JSIT rNANBST&J £1*7) fcWftlT 
#fc„ B *f- j3 1+ SMUfiiaffFiS I*, BMF3SOTfetUaEA { 

Lfctf, fcfc, *ra«S2 8 75«!(i6<, NANBffF&O-f 
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jtxc j: y ftg L, r a t #*. £*vr # fc„ 

[0003] iS#. NANBlff^^JWIffilitt. 5>-f 

■trow£i&<aiKL. «ar=iS)t+Ta>»73fitifi:»wi=aift 

&ifc**5(=ft-3fc. tAdE«feJFf %r=ga L-C, 1 988* 
7* CSJIT&^-OUX (BIT ChC 

vj iiSIB-TS) ©rnao* jux^v a©?p— >1t 
Ki3E5bLfc£«£U dhS4ti^HCvfii»(niMSSS 
m%Ltz. ffltt. HcvftttiHSi&li, JSifiLiflL&ro;*.? 

■ T!fliL*6ttTL^, C<DHCVSH*©ai£SI*NAN 
BJffjfetoB8iltt3li t »6M--aB-ei±B«>&JxS B Lfr 

i, N A N Bgf iS^©*tffll±tt« t L"t***ih.T l*ft 
H. NANB|ff3S(D«ai*TfiT?*y. *SF»ftl=J:-3 
T&fcJS&ifcfc (*yU7*) U SttffFiSilcjtKt-a 

[0 00 4] 

[ftw^iiiaLi^trsaa] ±e*m p>tt©Hc 
vflitt«ai*'y h-e[±»R-e#fti*NANBff*A<#» 

a -£)**>. BfcfcfcltaNANBBfjfe-J-Ol/X 

yy -r p isftoim Lfc h c v vj Atit&r l 

fc**^T. tt«S. flSaiCfe^-CS&tflHlfcNANB 

tW:„ *SSIK3©gWI±. NAN BBf jjfc'jn' IVT^fJ L. 

^Ut7 = yiU^t«MSff^t, ftp>mzc 
tlSffl^-CNANBKjS'S-f/tXMjitSffi. tsftroism 

[OOO 5] 

[gS 3 fctf>ro*K] **SB#6lt. NANB 

he *tf^>/i>i?-+¥ >J 7ffliniiij j: y n a n bsf^ 
-5-f*x©RNAt«IL. #a-K«wt#«aesT 

fc. *©Sft*3ei!B#&l±, NANBM^-fWtS 
5 H C - J 6 VJ Affl»£^«:(Dffi4£E5l] *^±rcflS 

50 oHcvyy Atrtaam^^-if&a. 
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[0006] EJiJS^ 1 Ee©^$B55i|£* 

- K^^^lgE£#t;9 0 9 9ffi©*2 L»sJ-^ K. fcf>tj: 
IZ3' *4SM 0 8»©UXh-y^£^<t;i 5 0ft<03l-> 

fswT'&y. c*Lbi. a»««»s*-r-6*A*Ba 

^2ffi«©£*E?J££r&#y*?b#^KN-9 5 10 

8 9, E5IJ#*§-3K«a>S*ffiSiJ£*rr-5cDNA$D 
-^J6-*81. e9)»^4jBttO%£S$>l£«-r% 
c DNA^n— >J 6-0 8, Z*i£,&*>7 U^-^Kt 

UlbaiTS. J WBMttt&KV KP-303 
S.ffiffiigfcT. ^'JK^U^h', cONA5Q-> 

p— 7-rtHfi#*fc1i^ / * P-t-^tii<*£n*WT7fcS. 
S »EftjKy<^*-f K£ffl(.tfe#A*BBIFft 

«HB4tt*tfflL*fc#A*BsiF*auii*(«)» 
[0007] ^^mNANBiff^^-rji/xae^-ita 

©*3I=LT«T. *©£&E5t|£*fe;tLfc 0 t hfcjc 
tff >/<W~jHlSlJ:y#fcft»:. HC-J1, HC- 
J4, HC- J 6£fflt,*-t#9f3E£iii#>fc„ HC-J 
1. HC- J 6l*B#A#lflt%4Jfe-C?, t,vf*lt.HCV 

a»Bttt**snfc*»* y#fc. hc-j4«na 30 

N BUFiS«)!SJft1t$ttffiLt=? W^i;-frC»fflfc«{* 
ir**? - &A*. L^LZ*lli7J-f P^tt©#SB©HCV 

ts#T?[* (-) T?&^fc„ c<D3a©i{ii«e«f*iyRN 

A^iBfiL, C*ib(-J:^TRNAO)SSE5l|$S3'< 
fc„ Z©5*>RNAffl5' 3Mli04l<l2 5OO, 
3' Sflt©#|1 1 0 0©ig£S£]lco[,>TI*KEI-#*f4 
S26 BttttWffilg (t#fig*F3 - ) 91 3 76f) fc& 

ui^6 a 2 o Bttft»tb»i=fc^T-5-<D-aii£isaiK l 

fc-aur, «fel=MKtadV«3-K^«mcDaD 40 
NA^P-Vffl«i£tfl?BBL, CftlZjc-aT. HC-J 
Bf/AfflSifiRNAg 5 S 9i£g©E?!l©Bf3££^ 

[0 0 0 8] H C - J 6 fJ Al±. SUSWc^-tJ:? 
IZ. 5' *J»«fcy3 4 0ffl©3l£U|-F.fcyft4 5' # 

«*, 3' *«fflf±, #«M&KSa-Kf 
offiS<0*=i-h'iSJS**f -5. 5' *«ffilfre>3 4 1 50 
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Si»rf->j:yftS53- K«art9o 9 gigs© 

-*fflfil*-^> l J-foy7l/-AJ:!ilSy, Z 
HA<3 0 3 37S^8g3- KUr^-&CtA*B&A^t 

tt-afc. hcv, t-**?*.csiff**>-f^xi±^©ae 
Tfltitfr t> 7 =3 tf o -r i-jajs-tf & 5 t #X 6 ftT I* 
-S* 4 , *K«©NANBfFjli'7^;uxaffi*ffl3-Hffl 

^) . ns-1 (#«itge~ i ) , ns-2 m- 

2) . NS-3 im-3) . NS-4 (H-4) „ NS 
-5 (0-5) A*&fc5i#x.e,;h5. 
[0009] *JHBfl)HC-J61rVAIl. 3 — 
#MHI«3 8 8. 2 3 2# l=***tfc 
HCvyyitttKtSt. a^b^FEJIIKOIOT 
±ffiWt?l±6 7. 9%<DffiStt^ > ifcTS/BEMf;: 
-3Utli4ffi«TJ7 2. 3W««Htt$*tl=t#S 

-K«WH=JSt»T*«E»|-e0 4. 4%, 37«WI=fe 
LvT7SySE?i|-l?9 0. 1<>6T?*y. cHta©H«-t? 
l±ibte«St»S|5|tt**L-CL^. i^B-7 
»»J:yT3![©ia«-(?l±e. NS-1. NS-2. NS 
-3, NS-4, NS-5©-t^f*v©7SyB9E«-e 
60. 4%, 71. 1%. 57. 8%. 81. 196, 7 
3. 1%. 6 9. 9%. 3' *«!#□- KS«<n£*ffi 

5II-C2 9. ewilv^tffil^WtttS-tfct-tfr. 

£5, HC-J 6«A<A^ □>&©»£ LfcHCVSt 

[0 0 10] *»W#e.HI-fi*Lfc (Japan. 
J. EXp. Med. , Vol. 60, 3, 1 67-1 
7 7) HC-J 1 (7J< 'J AS) , HC-J 4 (B* 
ID &t<DT*SMa#l<Dtt,®X'\t, ztT%MVtt.9 0 
%jy±©48P3ilfe£*-*-©l->ttLT. x>Kn-^fgJ«-e 
tt-eH-p+L 6 0 . 9%. 5 3. 1 WtBlvHHtttS-t 
l=*#fcA*-afc. **MI=J:y. HC-J6tttt, hc 

hc- j 4tt©L^r*Lt : t.m'5:*siiffiffl*-f 

^©■>-f JtXT? fcS C £ A** £ Ai t U o fee 
[00 1 1 ] 

[ftffl] *fiffl©ae^. ?Kyx*b^F, cdna 

?p->li. ^©^3BSfc[t~ai£A»B, tt¥S*© 

©sajst Lrfuniz $&r=, zftizw^- 

«/9P-^l«*lt. *A*BeBFKfltU«^LriB 
[00 12] 

[SJiffl] WT. *«W©S16«l=o^ra'<SA { . t 
4J:y**fflA<c*ie ) fflSJSfflrc!Bies^-6t©-e!i^ 

LV 

[0 0 13] HtSffli 
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{ 4 ) t#W!JP06H21689 

5 6 
(1) RNAWtti HC-J1, HC-J405' 5fciSSfll£3Ba?!l0>ft£ 

B*A*4l*©l4lSRA^^fc, HCVttttHMi V o DN A5-f r}=5 >)— £ffll>fcH C - J 1 fJ A©5 ' 

- ^c-fryy -J 1 ? - v^TiiXttsi, *-vhc S*i&lroi&gEfflS«ri*S2 (hc-ji) H3 (hc 

VAbELISA^vH t*DS*tlfc«i«:HC-J -J 4) KSt-f J; ? IC, c DN A£/^f 'Jt77— ✓ 

1. HC-J efc^tfNANBIffiBiOftSSaBLfc 1 ? 1 A g t 1 Ot»ALT»fcf n->«0iSfI«fo DNA 

>/Ov~4»6»fcHCvRft {-) ©tt«:HC-j4 SPCRicr«(gLr^e.^ifc5P-^m(B#fjg*mfi5 

<ty, awi^^LTRNASttthLfc. imkicmj*. «s»tt»SLfc. H2asj:tfe]3 0:NANBffHK^ 

(1 OmM. pH8. O) £fl3x., 68x1 J|,XfrV AO) 5 ' *aS*fHRB*Wffi«Stti t tlZS 

0 3 r pmT*1B#|HS<£>L*:. $ S *Lfc ^ L, -y H=2 0 U ffiffi^&^-l' V-OTteK*,*^, @ + , 
OmM NaCI, 10mM EDTA, 2%{w/ 10 $ t 'J *7t-5? A g t 1 OO^-f ?7 'J -l=J:59 □ 

v) h'T'S'MUH- h 'J (SDS) . feeing ->-efflHe***«Lfc«B*. Attl*PCRI=j:« 

/m I a>7PT^--4?K££t;HJJ*gttttgr;ft (5 0 $P— >T?£*KM£»£Lf::i8HSsK-*. HC-J 1 

mM. pH8. 0) Sftl*.. 6 0°C-eiB#|H]fll]£U X ffln t 4 54-2 1 O 90)1 6 5 6iSSIi^7Y7-# 

MMKSfTlV RNAjjJfc. 8**&»fcc DNAf A g t 10 77-^9*- (7 

[0014] (2) HC-J 1 cQNA^sE ?*>+Att) l-|*ALT$S>:h.fc9 4 1 Icj: y 

HC-J 1JtaRj:yttBLfcRNA£7 0t-eifl*UO JfcSLfc. 0*J=C(D«>— I> x>*£ |-fc|£ Lfc, 

SU LT 1 Oi-'> t-a&K^BS (c n t 8 2 4-8 4 3 <OSr Ll^-f ?-#2 5 (5' - 

DNA Synthesis SystemPlus, T C C C T G T T G C A T A Q T T C A C G- 3 ' ) £ 

77->YAft) *5J:tf*-'J^jl? L-tfK^-f^- fflUT, HC"J405oDNA ; 5'<^7 , J-j!r>e.JR*4 
(20 -me r) 20pmo I ^HDxT, 4Z°C, 1 5 20 "D(D c DNA?R->^60, 06 1. #66, 0 7 5 

l*IB£j£Sti:-CcDNA£fflfc. ^-fT-iLTtt. tH-C, ^-fe^T. •>-■ >x>X*&±gE 

3-P-V/<^»tBIHiaBB3 1 8. 2 1 6-%l-jS£*lfcH On t 1 8~8 4 3(Dig|tS?IJ$*ififc 0 

CVgaSOT$4WHLTSffiLfc#8 (5' -GATG [00 17J (5) HC-J 6, 5' SRSSSUlSSffiWD 
CTTGCGGAACGCAATCA— 3' ) Sffll* 

fc» hc- j ey/Acos' *ss«y©S£E?iii*E4its^ 

[0015] (3) jK'J*^— IfTx-f >U7?V3 ^ J; 5 lc. o D N A £P C RlCTljJig LT»e,*Lfc£ P- 

(PCR) KJ^cDNAfflfcifg >©flttr«W=J:y»*Lfc. HC-J6Bf*|Z^ 

Gene Amp D N AJtiBStJII*-? h (/i-*>x T. ttft Lfc R N Afr£flfr£ (2) OT^SEKtSo-C, o 

„, v _. ->-$xtt) SfflUfcSa i k i t,0JSi± (S DNA£«]ffiU x>7,*ftS Lfc. nt 2 

cience 23 9, 487-491, 198 8} 30 4- 2 5 5 1 l±H C- J 4 (D^KH^lZ Lfc^-f 
«fcy, DNAf-7-f?7- (/^-^-i/Jijt,?- - 7-iD)B*^tl:J:5PCRi:r»btifc^Q~> 
?Xit) 1C-C3 5W9(M0c DN AiSMISfi^fc. y-^I^frb^tiW;, 
[0016] (4) cDNA^^5U-OT8IJ|gl=<fc£ 
n t 2 4~8 26 

#32 (5' -ACTCCACCATAGATCACTCC-3' ) 
#122 (5' — AGGTTCCCTGTTGCATAATT — 3' ) 

9P->;C9 397, C9388. C9764 
nt732~1907 

#50 (5' — GCCGACCTCATGGGGTACAT — 3' ) 
#128 (5' -TCGGTCGTGCCCACTACCAC-3' ) 

i'P-i'; C9316, C9752, C9753 
nt1B67-2591 

#149 (5' — TCTGTGTGTGGCCCAGTGTA — 3' ) 
#146 (5' — AGTAGCATCATCCACAAGCA — 3' ) 
*D-> ; C 1 1621, C1 1624, C11655 
£5>II±iffE<D5' *aS«y^^^-f Sfctt. nt246~ d A#Altm*. 2Sfg(0o n e-s i d e d PCRli 

265OT7>^-tr>X^7-rT~#36 (5' — AAC (SSftofc. ti'^t, 1 ShWJ =f d T V- 

ACTACTCGGCTAGCAGT — 3' ) tfflL^T (2 0-me r) t n t 1 8 8 ~ 2 0 7 ©T^-fe^A 

cDNAS^Lfcffit, * — 5-^-JU7 i ^-*->jl'?^ ^7-f7-#4 8 (5' -G T T G A T C C A A G A A 

fiJJbh7VX7x5- fflCfcy o DN AC05' 50 AGGACCC-3' ) 3rffll*"t 3 Si^-f •? JKD P C R 
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JdT^<7-(20-mor) t n t 1 4 0 ~ 1 6 
0<07'>-J L -b>X"? : 7-i , -7— # 109 (2 1 -ma r : 
5' — ACCGGATCCGCAGACCACTAT— 
3' ) £J8t,\T3 0-9--f ^^mPCR^lTcifc. f§e>*l 

1 3S©iffi<l:Lfc£ □ — C957 7, C9579. 
C958 1, C9 5 8 4, C9 587, C9590, C 
9591. C959S. C9606, C 9 6 0 9 , C9 10 
615, C9616. C9619 ttl^Oi^ >"ir 

^li-X Jl>7*6n t 1 ~2 3©lggEJ!|£3!; 

(6) HC-J 6, +*H«eSBE5Ua)*S 

hc- j ejfiijgi o Om I (2) ©tf&lzif 

CTA g t 1 0/^f l Jt77-vtcDNA7'<7'7 
y-£WS&Lfc. ??<7-iLt, 3-P'vA»ftUl 
SB93 1 8, 2 1 6-51:15: $:fifeHCV££S9!l£#ltB 
L7MtLfe#'J3(9L<:*7' h*#i 6 2fc<fctf#8 i £ 

vi=*yffofc. ■ir=s+j:3i;. &t>*Lfc4fB©c 

DN A£ □ — >0 2 (nt 6996-8700) , 06 
(nt 6485~8700) , 08 (nt 6008-8 

7 005,081 (n t 2 1 9 9-6 1 6 8) 

52~87oo©Ms?iS3sjLfc. ?R 
->08 1. 0 8r±-t*i-P4tK5IJ#^3. 4©£gE5iJ 

[0 0 18] (7) HC- J 6?/J*0)3- £Sj|Bfl!©J£ 

fc*p- yjSfejgLfc. HC-J6(Dnt 

8 7 0 1 -924 1 a)iB5i|i=ourrt«t*s^5Y-7- 

#80 (5' — GACACCCGCTGTTTTGAC 
TC-3' ) teJ:tf#60 (5' — GTTCTTACT 
GCCCAGTTGAA-3' ) £ffll>fcP C RK^ 
If, S6*lfc9 3 3ffiS«)^ O-^C 9 7 6 O. C92 
34, C976 1«S£e5rj < fcy»&jh.-5 3>te>frX 
?XVX*baSLfc. n t 9 24 2J:yTj6ffl> 
3' *!ISfflSgEMftJ5lT©75&fflffioT*£Lfc. * 
Sfrfc. «-ft*J:y CD IB*<D;*&fcffioTRNA£ 40 
tttliU JK'J (A) Jif'Jjt^— tf£fflUTRNA<D3' 

swttfy (a) sftjpL. =j-y=r (dT) zo 

t LTc DN A*J*tff(.\ »fc>tlfccDNA£ 

f>7"l/-hiltPCR)cftUfc. pcr|±3Si KB 
iLT-b^7'7-f'7-tLT#9 7 (5 -AGTC 
AGGGCGTCCCTCATCT-3' ) £, 

•ofc, *t. S^tlfcc DNA£&2|&PgPCR£LT 

s 1 KSJ: y Tssctta-ra* >x -*■-# 9 o 
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(5' — GCCGTTTGCGGCCGATATCT— 
3' ) 7 , V^-bVX^ : 7'f7-iLT:7|-'J=r (d 
T) zo fcfflUTfTofc. 2SB©PCRT?Se>*lfc)fi 
BS«JI= T 4 D N Affs y 7— tf£ttffl * 
Lfeffe. T4iKyx*U-*T-K++— BUT 5' *4BSS 

y>BftL, Ml 3 m p 1 9 77—^^9-roH i n 

fc„ -5LTtbtlfc1 9i(D7B- >, CI 03 1 
1. C10313, C10314. C1 0320. CI 
0322, C10323, CI 0326, C1032 
8, C10330, C10333. C10334, C1 
0336, C10337, C1034S, C1034 
6. C10347, C10349. C10350, C1 
03 5 7^6>?#£>;h.5P>-fe>+l-X >~^i>x«ty 
3' *4SSiJ©£aE5i|£fc5£Lfc„ UJblOIBIIfBSJ: y 
H C- J 6 AfCfeSr S c O N A J63iBB5<J£BM# 
52iB«©<fc9f=»£U SfclrVimStfiRNAS 
*BM * E5>J#5 1 IE«© J: 5 l=ft« L fc . 
[0 0 19] (8) HC-J 6, 7 S / KK51|©*£ 
HC-J8VyAORNA«BEIttttK, SJ&zi K 
l/ATG*y *&*■&=! - KAMar S y MBffl £E9j» 
#5©J:?IZife£Lfc. *©MIL HC-J6l±3 03 
3 7 5 J MB** 'J -f KSffRft* □ - K 

[002 0] 

[««©&*] ^A^BSffFjfe^-CJUXHC 

+Mfi«:*tl«-*- 
[Hffi©MM&ttSft8] 

IBM] I*. :M&W©NANBK£*<ni,Xft&7-©p 
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E#Jg# = 1 
iW>£S : 9 5 8 9 

Wl&MW '.genomic RNA 

m\ 

ACCCGCCCCB AABAfiGGGCG ACAC0CC6CC ABGAACCACU CCCCU6U6AG 6AACBACUGU 60 

CBUCACGCAG AAAGCGUCBA GCCAUGGCGU OAGUAliGAGU GUCGBACA6C CUCCAGGCCC 120 

CCCCCUCCCG GGAGAGCCAU AGBGGUCBGC GGAACCG6UG AGUACACCGG AABWGCCGSG 180 

AAGACUGGGU CCUBIICUBGG ABAAACCCAC UCUAUGCCCG GBCAIfBUGGG CGBGCCCCCG 240 

CAAGACIIGCO AGCCGAGOAG CGBBGGGUB6 CGAAAGGCCB UGUGGliACOG CCIfGAUAGGG 300 

BBCOUGCGAG UGCCCCGGGA GGIKUCGUAG ACCGUGCACC AUGAGCACAA AJICC1MAACC 360 

(ICAAAGAAAA ACCAAAAGAA ACACCAACCG UC6CCCACAA GACGUBAAGtf IHJCCGGGCGG 420 

CGGCCAGAOC GUUGGCGGAG 0ABACBB6UU GCCGCGCAGG GGCCCCAGGU IJGGGBGUGCG 460 

CGCGACAAGG AAGACUUCGG AGCGGUCCCA GCCACGOGGA AGGCGCCAGC CCABCCCIM 540 

GGABCGGCGC UCCACBGGCA AAUCCBG6GG AAAACCAGGA BACCCCJJGGC CCCtfAOACGG 600 

GAAUGAGGGA CUCGGCIWGG CAGGAUGGCU CCUGUCCCCC CGAGGUUCCC GUCCCUCBOG 660 

GGGCCCCAAU GACCCCCGGC AOAGGUCCCG CAACGBGGGO AAGGBCAUC6 AUACCCOAAC 720 

GUGCGGCBBU GCCGACCUCA UGGGGOACAU CCCUGUCGBA GGCGCCCCGC JJCGGCG6CGU 760 

CGCCAGAGCU CUCGCGCABG GCGBGA6AGU CCBGGAGGAC GGGGUUAAUU UUGCAACAGG 640 

GAACUUACCC GGUBGCUCCB BBBCUABCJU CBUGCUGGCC CUGCUGUCCU GCABCACCAC 900 

CCC66BCUCC GCWCCGAAG BGAAGAACAU CAGUACCGGC UACAIIGGUGA CCAACGACBG %0 
CACCAAUGAU AGCAUUACCB GGCAACUCCA GGCUGCJGUC CUCCAC6BCC CCGGGUGCGfJ 1020 
CCCGHGCGAG AAAGlfGGGGA AUACAUCUCG GBGCBGGAUA CCGGBCUCAC CGAAUGUGGC 1060 
CGUGCAGCAG CCCGGCGCCC OCACGCAGGG CilUACGGACG CACABUGACA UGGBUGUGAB 1 140 
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GUCCGCCACG CUCUGCUCCG CUCUUHACGU GGGGGACCUC UGCGGUGGGG UGAUGCUUGC 1200 
AGCCCAGAUG UUCAUUGUCU CGCCACAGCA CCACOGGiJUU GUGCAAGACU GCAAUUGCUC 1260 
CAUCUACCCU GGDACCAUCA CUGGACACCG CAUGGCGUGG GACAUGAUGA UGAACUGGUC 1320 
GCCCACGGCU ACCAUGAUCC UGGCGUACGC GAtfGCGCGUC CCCGAG6UCA UCAtlAGACAU 1380 
CAUUGGCGGG GCUCAUUGGG GCGHCAUGUU CGGCUUAGCC UACUtlCUCUA UGCAGG6AGC 1440 
GUGGGCAAAA GUCGUUGUCA IWCUUUUGCU GGCCGCCGGG GUGGACGCGC AAACCCAIfAC 1500 
CGUUGGGGGU OACCGCGC AUAACGCCAG GACCCUCACC GGCAtfGtWCU CCCHUGGUGC 1560 
CAGGCAGAAA AtfCCAGCUCA (ICAACACCAA UGGCAGUUGG CACAIICAACC GCACCGCCCU 1620 
GAACtfGCAAU GACUCUUUGC ACACCGGCUU CCUCGCGUCA CIJGfJUCl/ACA CCCACAGCUU 1680 
CAACtlCGUCA GGAUGUCCCG AACGCAUGUC CGCCUGCCGC AGUAUCGAGG CCUUUCGGGU 1740 
GGGAUGGGGC 6CCUUACAAU AUGAGGACAA UGUCACCAAU CCAGAGGAUA ilGAGACCGUA 1800 
UUGCUGGCAC UACCCACCAA GACAGUGtIGG UGilAGUCUCC GCGAGCIICIIG tIGUGUGGCCC 1860 
AGUGUACUGU UUCACCCCCA GCCCAGUAGU AGUGGGUACG ACCGAUAGAC IIUGGAGCGCC 1920 
CACUUACACG UGGGGGGAGA AUGAGACAGA UGflCUUCCUA UEJGAACAGCA CUCGACCACC 1980 
GCAGGGGUCA KGGUUCGGCU GCACGUGGAU GAACUCCACU GGCUACACCA AGACMGCGG 2040 
CGCACCACCC UGCCGCAUUA GAGCUGACtfU CAABGCCAGC AUGGACUUGU UGUGCCCCAC 2100 
GGACUGMUU A6GAAGCAUC CUGAUACCAC CUACAUCAAA UGUGGCUCUG GGCCCHGGCU 2160 
CACGCCAAGG UGCCOGAUCG ACHACCCCUA CAGGCUCMGG CAifUACCCCU GCACAGUlfAA 2220 
CUAUACCAUC UUCAAAAUAA GGAUGUAUGU GGGGGGGGUC GAGCACAGGC UCACGGCUGC 2280 
GUfiCAAUUUC AClfCGUSGGG AlfCGUUGCAA CUUGGAGGAC AGAGACAGAA GUCAACUGUC 2340 
UCCUUUGCJG CACUCCACCA CGGAGUGGGC CAUUJKACCU UGCACUUACU CGGACCOGCC 2400 
CGCCUUGUCG ACWGGBCUUC BCCACCUCCA CCAAAACAUC GUGGACGUGC AAUUCAUGUA 2460 
UGGCCUAUCA CCUGClfCUCA CAAAAUACAU CGUCCGAUGG GAGUGGGUAG UACUCUUAUU 2520 
CCtlGCUCUUA GCGGACGCCA GGGUUUGCGC CUGCtfUAl/GG AlfGQUCAUCU (IGUUGGGCCA 2580 
GGCCGAAGCA GCACUAGAGA AGUUGGUCGSJ CUUGCACGCU GCGAGCGCAG CUAGCHGCAA 2640 
UGGCUUCCUA UACUUUGUCA UCEJUUlfUCGU GGCUGCUUGG UACAl/CAAGG GUCGGGUAGU 2700 
CCCCUUGGCU ACIWAUtJCCC UCACUGGCCV AUGGUQCUUU GGCCUACUGC UCCUAGCAUU 2760 
GCCCCAACAG GCIWAU6CUU AUGACGCAUC UGUACAUGGU CAGAUAGGAG CAGCUCM 2820 
GGHACUGAUC ACUCUCtllfUA CACUCACCCC CGGGUAIMG ACCCUIOCA GCCGGUUUCU 2860 
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GUGGffGGJUS 


UGCUAUCUUC 


UGACCCIiGfiC 


UAUGCAGGUG 


CGCGGUGGCC 


GUGAUflGGAU 


UGU66VGIIUU 


GACAUAACCA 


AGUGGCUCUiJ 


AGGUGCUUBG 


ACGCGUGUGC 


CGUACJUCflU 


CAUGGUAAGG 


CAUCUCGCGG 


6GGGUAGGUA 


GUGGACtlGGC 


ACUUAGAUCU 


AUGACCACCU 


CCUGCGGGAC 


UUGGCGGUCG 


CCGUGGA6CC 


CAUCGUCUGG 


GGAGCGGAGA 


CAGCUGCUUG 


CGCCCGACHu 


GGCCGGGAGG 


(ICCUCCUUGG 


GAGUCUUCUC 


GCCCCCAUCA 


CUSCUUAUGC 


AGUGGUGAGC 


AUGACGGGGC 


GCGACAA6AC 


CACGGHCACU 


CAGBCCUUCC 


UCGGAACAAC 


UGGAGCUGGC 


AACAAGACUC 


HAGCCGGCUC 


SJGCUGAGGGG 


GACUUAGUGG 


GGIKS6CCCAG 


CACGUGUGGA 


GCGGUCGACC 


IfAUACCUGSU 


AAGACGCGGG 


GACAAGCGAG 


GAGCGCUACU 


GUCCIICGGGG 


GGCCCGGUGC 


UCWGCCCCAG 


CGUGtiaCUCC 


CG6GGCGUGG 


CCAAGUCCAU 


CGMCACKC6G 


UCCCCCACCU 


1IUAGUGACAA 


UCAGGUCGGG 


IMCUUACAUG 


CCCCGACUGG 


GUAVGCCGCU 


CAGGGGUACA 


AAGUGCUASU 


iMmiiiv'PGPf 

utlUIRiuualfU 


IWIl.UUlliR.lrft 


ftuuLAl#R UUu 


GACUGUGACG 


ACCGGGGCGC 


CCAl/CACGKA 


GGGCUGCGCA 


GGCGGCGCCU 


AUGACAtfCAU 


UACCACCAUU 


CUCGGCAUCG 


GAACAGUCCU 


AACUGUACUG 


6CUACG6CUA 


CGCCCCCCGG 


GGA6GUGGCC 


CUCGGGCAG6 


AGGGUGAGAU 


AUACAUCAAG 


GGAGGAAGAC 


ACUUGAUCUU 


CGCGGCGGCC 


CUUCGGGGUA 


UGGGCUUGAA 



GGAAGCUAUG GUCCAGGA6U G6GCACCACC 294* 
CAtfABGGGCC GUCGCCAIMU UCUGCCCGQG 3000 
GGCGGUGCIIU GGSCCUGCUU AUCUCCUAM 3060 
CAGGGCUCAC GCUCUACUAA GGAUGUGCAC 3120 
CGUCCAGAUG GUGCUACUAG CCCUtlGfiCAG 3180 
CACCCCUAUG UCGGAUUGGG CUGCUAAHGG 3240 
UAUCAUCUUC AGUCCGAOGG AGAAAAAAGU 3300 
CGGGGAUAUC UUACAC6GAC UUCCCGIfGUC 3360 
CCCAGCUGAU GGCUAUACCU CCAAG80GUG 3429 
CCAGCAGACA CGCGGCCuJl! UGGGCACCAU 3+80 
AGAACAGGCC GGGGAGAWC AGGUCCKGUC 3540 
CAUCUCGGGG GUCUUAUGGA CUGUCUACCA 3600 
ACGGfiGUCCG GUCACACASA UGUACUCCAG 3660 
CCCCCCCGGG ACCAAAUCUU UGGAGCCGUG 3720 
CACGCGAAAC GCUGAUGUCA UCCCGGCUCG 37*0 
CUCCCCGAGA CCUCUUUCCA CCUUGAAGGG 3840 
AGGCCACGCU GUCGGGGUCU IfCCGGGCAGC 3900 
AGAUOUUAUC CCCGUUGAGA CACUUGACAU 3960 
CAGCACACCA CCUGCUGUGC CCCAAACUUft 4020 
UAGDGGAAAG AGCACCAAAG fJCCCUGUCGC 4080 
GC88AAUCCC tfCGGOGGCUfl CCACCCU6GG 4140 
CAttCAAUCCC AACAtfUAGGA CUGGGGUCAG 4200 
CUCCACAUAO 6GCAAAUUCC UCGCCGAUGG 4260 
CAflAuGCGAU GAAUGCCAUG CCGU6GACUC 4320 
CGAUCAAGCA GAGACAGCCfl GGGtfCAGGCU 4380 
GUCAGUGACA ACCCCCCACC CCAACAUAGA 4440 
CCCCUUCUAU GGGAGGGCGA UUCCCCUGUC 4500 
CUGCCACUCA AAGAAAAAGU GUGACGAGCU 4560 
CGCAGJJGGCA UACUACAGAG GGCUGGACGU 4620 
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CEJCCGUAAUA CCAACUCUGG GAGACGUAGU GGUCGUCGCC ACCGACGCCC UCAUGACGGG 4680 
GUUUACIKSGA GACUUUGACU CCGUGAUCGA CUGCAACGUA GCGGUCACUC AAGUOGUAGA 4740 
CUUCAGCUUG GACCCCACAU UCACCAUAAC CACACAGACt) GJCCCUCAAG ACGCtlGUCfJC 4800 
ACGlfAGCCAG CGCCGGGGCC GCACGGGCAG GGGAAfiACUG GGUAUUUADA GGUAUGIMUC 4860 
CACUGGUGAG CGAGCCUCAG GAAUGHtltfGA CAGUGUAGUG CUCUGCGAGU GCUACGAUGC 4920 
AGGGGCCGCA IfGGUAUGAGC UCACACCAGC GGAGACCACC GE/CAGGCUCA GAGCAUAUUU 4980 
CAACACACCU GGOUUGCCUG UGUGCCAAGA CCAUCUUGAG UUUUGGGAGG CAGUUUUCAC 5040 
CGGCCtlCACA CACAUAGAUG CCCACUtlCCU UUCCCAAACA AAGCAAUCGG GGGAAAAJUU 5100 
CGCAUACUUA ACAGCCUACC AG6CUACAGU GU6CGCUAGG GCCAAAGCCC CCCCCCCGUC 5160 
CUGGGACGUC AUGUGGAAGU GUUUGACUCG ACUCAAGCCC ACACUCGUG6 GCCCCACACC 5220 
IJCUCCU6UAC CGCIMGGGCU CUGtiUACCAA CGAGGUCACC CWCACGCAIfC CUGtlGACGAA 5280 
AUACAUCGCC ACCUGCAUGC AAGCCGACCtl UGAGGUCAUG ACCAGCACGO GGGUCUOAGC 5340 
UGGGGGGGUC [fUGGCGGCCG UCGCCGCGUA CUGCCUGGCG ACCGGGUGUG UUUGCAHCAU 5400 
CGGCCGCUUG CACGUUAACC AGCGAGCCGO CGUJGCACCG GACAAGGAGG UCCUCUAUGA 5460 
GGCIfUUUGAU GAGAUGGAGG AAUGUGCCOC UAGAGCGGCU COCAUUGAAG AGGGGCA6CG 5520 
GAUAGCCGAG AUGCUGAAGU CCAAGAUCCA AGGCUUAUUG CAGCAAGCBU CCAAACAAGC 5580 
UCAAGACftM CAACCCGCUG UGCAGGCUUC UUGGCCGAAG GJJAGAGCAAM HCUGGGCCAA 5640 
ACACAUGUGG AACUUCAUCA GCGGCAMCA AUACCIfCGCA GGACUAUCAA CACUGCCAGG 5700 
GAACCCUGCU GUAGCUUCCA UGAU6GCAWU CAGIFGCCGCC CUCACCAGUC CGUlfGOCAAC 5760 
UAGCACCACU AUCCUUCUCA ACAUUPGGG GGGCOGGCIW GC&UCCCAA& IWGCGCCUCC 5824 
CGCGGGGGCU ACCGGCUUCG UCGUCAGlfGG CCUGGUGGGG GCJGCCGUAG GCAGCAWAGG 5660 
CUtHWGlWAG GUGClKSGUfiG ACAUCClfGGC AGGGBAUGGU GCGGGCAMtJ CGGGCGCUCU 5940 
CGUCGCAUUC AAGAUCAUGO CUGGCGAGAA GCCCUCCAUG GA6GAUG0UG UCAACCI/6CU 6000 
GCCtfGGAAUU CIIGUCUCCGG GWSCCCU6GU GGUGGGAGUC AUCUGCGCGG CCAUCCtfGCG 6060 
CCGACACGUG GGACCGGGGG AAGGCGCUGU CCAAUG6AUG AAUAGGCUCA UUGCCUIHIGC 6120 
UUCCAGAGGA AACCACGUCG CCCCCACCCA CUACGUGACG CAGUCGGAUG CGtlCGCAGCG 6180 
IfGKGACCCAA CUACUUGGCU CCCUUACCAU AACCAGCCUG CDCAGGAGAC UCCACAA0U3 6240 
GAUUACUGAA GACUGCCCCA UCCCAUGCAG CGGCBCGUGG CUCCGCGAUG UGUGGGAUUG 6300 
GGOUtlGCACC AI1CC If AACAG ACUUUAAAAA CUGGCUGACC UCCAAAUUGO UCCCAAAGAU 6360 
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GCCUfiGUCUC CCCUUUAUCU COUGUCAAAA GGGGUftCAAG GGCGUGUGGG CUGGCACHGG 6420 
UAlTCAUfiACC ACACGGUGUC CUUCCGGCGC CAAUA1ICUCH GGCAAIFGUCC GCCUfiGGCUC 6480 
CAUGAGAAUfJ ACGGGGCCCA AAACCE1GCAU GAAIMCUGG CAGGGGACCU IHJCCCAUCAA 6540 
liliGWACACG GAGGGCCAGU GCGUGCCGAA ACCC6CACCA AAClKiUAAGA UCGCCAUCUG 6600 
GAGGGUGGCG GCCUCAGAGU ACGCGGAGGU GACGCAGCAC 6GGUCAUACC ACUACAlfAAC 6660 
AGGACIWACC ACUGAUAACU UGAAAGUtfCC UUGCCAACUA CCUUCUCCAG AfifHJCUUl/UC 6720 
CUGGGUGGAC GGAGUGCAGA UCCAUAGGUU UGCCCCCAUA CCGAAGCC6U UUUVttCGGGA 6780 
UGAfiGlfCUCG UUCUGCGUUG GGCUUAA1WC AUUIfGUCGUC GGGUCUCAGC UCCCUUGCGA 8840 
UCCU6AACCU GACACAGACG UAUUGACGtIC CAUGCJAACA GACCCAUCCC AUABCACGGC 6901} 
GGAGACUGCA GCGCGGCGUU UGGCACGGGG GUCACCCCCG UCCGAGGCAA GCUCCUCAGC 6960 
GAGCCAGCOA UCGGCACCAU CGCUGCGAGC CACCIKCACC ACCCACGGCA AGGCCUAUGA 7020 
UGUGGACAUG GUGGAUGCCA ACCUGUUCAU GGGGGGCGM GUGACCCGGA UAGAGUCU6A 70*0 
GUCCAAAGUG GUCGUUCUGG ACUCUCUCGA CCCAAUGGUC GAAGAAAGGA GCGACCUUGA 7140 
GCCUUCGAUA CCAUCGGAAU AUAUGCIfCCC CAAGAAGAGA UWCCCACCAG CCHUACCGGC 7200 
UUGGGCACGC CCUGAUUACA ACCCACCGCiJ UGl/GGAAUCG UGGAAGAGGC CAGAUUACCA 7260 
ACCGGCCACU GUUGCGGGCU GCGCUCUCCC CCCCCCUAAG AAAACCCCGA CGCCtfCCCCC 7320 
AAGGAGACGC CGGACAGUGG GUCUGAGU6A GAGCUCCAUA GCAGAlfGCCC UACAACAGCU 7380 
GGCCAUCAAG UCCUIWGGCC AGCCCCCCCC AAGC6GC6AU UCAG6CCUUU CCACGGGGGC 7440 
GGACGCAGCC GAUUCCGGCA GUCGGACGCC CCCCGAUGAG UUGGCCCUUU C6GAGACAGG 7500 
UtfCCAifClfCC UCCAUGCCCC CUCUCGAGGG GGAGCCUGGA GAUCCAGACU (JGGAGCCUGA 7560 
GCAGGUAGAG CMCAACCWC CCCCCCAGSG GGGGfilJQGUA ACCCCCGGCU CAOGCBCGGG 7620 
GUCOtlGfiUCU ACUUGCiiCCG AGGAGGACSA CUCC6IHBUG UGCUGCUCCA UGUCAHACUC 7680 
CUGGACCGGG GCUCUAAUAA CUCCUUGUAG CCCCGAAGAG GAAAAGUIiGC CAAUtfGGCCC 7740 
CUUGAGCAAC UCCCUGUUGC GAUAUCACAA CAAGGUGUAC UGUACCACAU CAAAGAGCGC 7800 
CUCAUUAAGG GCUAAAAAGG JAACUUUBGft UAGGAUGCAA GCGCUCGACG CUCAUUAWGA 7860 
CUCAGIOUG AA6GACAUHA A6CUAGCGGC CUCCAAGGUC ACCGCAAGGC UUCifCACUUli 7020 
AGAGGAGGCC IHJCCAGUUAA COCCACCCCA CUC1TCCAAGA UCCAAGUAUG GGUUUGGGGC 7980 
UAAGGAGGUC CGCAGCUUGU CCGGGAGAGC CGUUAACCAC AUCAAGOCCG UGUGGAAGGA 8040 
CCUCClffiGAA GACACACAAA CACCAAUUCC UACAACCAUC AUGGCCAAAA AifGAGGUGtlt) 8100 
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CUGCGU6GAC CCCACCAAGG GGGGUAAGAA AGCAGCUCGC CUUAUCGUUU ftCCCUGACCU 8160 
CGGCGtfCAGG GUCUGCGAGA AAAUGGCCCU HUABGAUAUC ACAGAAAAGC UUCCUCAGGC 8220 
GGUGAUGGG6 GCUtfCUUAUG GAUUCCAGUA CUCCCCCGCtl CAGC6G6U6G AGMUCIJCUU 42*0 
GAAGGCAWGG GC6GAAAAGA AAGACCCIttU GGGHOUUIfCG HAUBAMCCC GAifGCUUUGA 8340 
CUCAACCGtfC ACUGAGAGAG ACAUCAGGAC UGAGGAGUCC AttAUAUCGGG CUUGUUCCUU 8400 
GCCCGAGQAG GCCCACACUG CCAUACACHC ACUGACUGAG AGACUUUACG UGGGAG6GCC 3460 
CAUGOUCAAC AGCAAGGGCC AGACCUGCGG GUACAGGCGU UGCCGCGCCA GCGGGGtfGCU 8520 
UACCACUAGC AUGGGGAACA CCAUCACAilG CUA1/GUGAAA GCCUUAGCGG CCUGUAAGGC 858G 
UGCAGfiGAUA AIWGCGCCCA CAAUGCUGGU AUGCGGCBAU GAClKfGGIfUG UCAUCIfCAGA 8640 
GAGCCAGGGG ACCGAGGAGG ACGAGCGGAA CCUGAGAGCC (IUCACGGAGG CUAlfGACCAG 4700 
GUAUtfCUGCC CCUCCDGGUG ACCCCCCCAG ACCGfiAAlM GACCUGGA6C OGAUAACAUC 6760 
UU6CUCCUCA AAlfGUGUCUG UGGCGUU6GG CCCACAAGGC CGCCGCAGAU ACUACCJGAC 8820 
CAGAGACCCU ACCACUCCAA UCGCCCGGGC UGCC08GGAA ACAGUUAGAC ACUCCCCUGU 3580 
CAAUUGAUGG CUAGGAAACA UCAUCCAGM CGCCCCAACC AUAUGGGCHC GCAUGGlfCCO 8940 
GAUGACACAC UUCUUCUCCA (WCIKAUGGC CCAAGAEJACU C9GGACCA6A ACCHCAACUU 9000 
UGAGAifGUAC GGAGCGGUGU ACUCCGUGAG UCCCtfUGGAC CUCCCAGCCA UAAUUGAAAG 9060 
GUUACACGGG CUffGACGCUU IfCUCUCMGCA CACAHACACU CCCCACGAAC UGACACGGQlt 9120 
GGCUUCAGCC CUCAGAAA/C UUQGGGCGCC ACCCCUCAGA GCGUGGAAGA GCCGGGCACG 9180 
D6CAGUCAGG GCGtfCCCUCA UCfJCCCGOGG GGGGAGAGCG GCCGUUUGCG GCCGAIWIO 9240 
CUdCAACUGG GCGGUGMGA CCAAGCUCAA ACIBACUCCA UVGCCGGAAG CGCGCCUCCU 9300 
GGAUDUAUCC AGCUGGUUCA CUGIKGGCGC CGGCGGG6GC GACAUUUABD ACAGCGUGUC 9360 
GCGtfGCCCGA CCCCGCUUAU UACUCCUUGG CCIMCUCCUA CtfUUUUGUAG GGGI/AG6CCU 9420 
UUUCCUACUC CCCGCUOGGU AGAGCGGCAC ACAUOAGCUA CACUCCAIMG CUAACtfGUCC 9480 
CUUUU Jl/UUU UUUUUUUUUU UUUUUtfUUUU UUUU9UUUUU UUUUUUIJUBJ UU««liUlflJUU 9540 
UUOUUUlfUUU UUUUUUUUUU UMUMUHUli UtiUUfllMUU UUUUMJUM 9589 
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i2#l«£HI : c DN A to genomic RNA 

ACCCGCCCCT AATAGG66CG ACACTCCGCC ATGAACCACT CCCCTGTCAG GAACTACTGT 60 

CUCACGCAG AAAGCGTCTA GCCATGGCGT TAGFATGA6T GTCGTACAGC CTCCAGGCCC 120 

CCCCCTCCCG GGAGAGCCAT AGTSGTCTGC GGAACCGGTG AGTACACCGG AATTGCCGEG 180 

AAGACTG6GT CCTTTCTTGG A7AAACCCAC TCTATGCCCG GTCATTTGGG CGIfiCCCCCC 240 

CAAGACTGCT AGCCCAGTAG C6TTGG6TTG CGAAAGGCCF TGTGGTACTG CCTGATAGG6 300 

TGCTTGCGAG TGCCCCGGGA GGTCTCGTA6 ACCGTGCACC ATGAGCACAA ATCCTAAACC 360 

TCAAAGAAAA ACCAAAAGAA ACACCAACCG TCGCCCACAA GACGTTAAGT TTGCGGGCGG 420 

CGGCCAfiATC GTTGGCG6A6 TATACTTGTT GCCGCGCACG GGCCCCAGGT TGGGTG7GCG 480 

CGCGACAAGG AACflClICGG AGCGGTCCCA GCCACGT6GA AGGC6CCAGC CCATCCCTAA 540 

GGATCGGCSC TCCACTGGCA AATCCTGG6G AAAACCAGGA TACCCCTGGC CCCTATACGG 600 

GAATGAGGGA CTCGGCTGGG CAGGATGGCT CCTGTCCCCC CGA66TTCCC GTCCCTCTTG 660 

GGGCCCCAAT GACCCCCGGC ATA8GTCCCG CAACGTGGGT AAGGTCATCG ATACCCTAAC 720 

GTGCGGCrTT GCCGACCICA TGGGGTACAT CCCTGTCGfA GGCGCCCCGC TCGGCGGCGT 780 

CGCCAGA6CT CTCGCGCATG GCGIGAGAGT CCTGGAGGAC GGGGTTAATT TTGCAACAGG 840 

GAACTTACCC GGTTGCTCCT TTTCTATCTT CTTGCTGGCC CTGCTGTCCT GCATCACCAC 900 

CCCGGTCfCC GCTGCCGAAG TGAAGAACAT CAGTACCGGC TACATGGTGA CCAACGACTG 960 

CACCAATGAT AGCATTACCT GGCAACTCCA GGCTCCTCTC CTCCACGTCC CCGG6TGCGT 1020 

CCCGT6CGAG AAAGTGGG6A ATACATCTCG GTGCTGGATA CCG6TCTCAC CGAATGTGGC 1080 

CGTGCA6CAG CCC6GCGCCC TCACGCAGGG CTTACGGACG CACATTGACA TGGTTGTGAT 1140 

GTCCGCCACG CTCTGCTCCG CTCTTTACGT GGGGGACCTC TGCGGTGGGG TGATGCTTGC 1200 
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AGCCCAGATG 


TTCATTGTCT 


CGCCACAGCA 


CCACTGGTTT 


GTGCAAGACT GCAATTGCTC 


1260 


CATCTACCCT 


GGTACCATCA 


CTGGACACCG 


CATGGCGTGG 


GACATGATGA TGAACTGGTC 


1320 


GCCCACGGCI 


ACCATGATCC 


TGGCGTACGC 


GAT6CGCGTC 


CCCGAGGTCA TCATAGACAT 


mo 


CATTG6CGGG 


GCTCATTGGG 


GCGTCATGTI 


CGGCTTAGCC 


TACTTC7CTA TGCAG6GAGC 


1440 


GTGGGCAAAA 


GTC6TTGTCA 


ttcttttgci 


GGCCGCCGGG 


GTGGACGCGC AAACCCATAC 


1500 


CGTTGGG6GT 


7CTACCGCGC 


ATAACGCCAG 


GACCCTCACC 


GGCATGTTCT CCCTTGGTGC 


1560 


CASK AC AAA 


ATCCAGCTCA 


TCAACACCAA 


TGGCAGTTGG 


CACATCAACC GCACCGCCCT 


1620 


GAACTGCAAT 


GACTCTTTGC 


ACACCGGCTT 


CCTCGCGTCA 


CTGTTCTACA CCCACAGCTT 


1680 


CAACTCGTCA 


GGATSTCCCG 


AACGCATGTC 


CGCCTGCCGC 


AGTATCGA6G CCTTTCGGGT 


1740 


GGGATGGGGC 


GCCRACAAT 


ATGAGGACAA 


TGTCACCAAT 


CCAGAGGATA TGA6ACCGTA 


1800 


TT6CTGGCAC 


TACCCACCAA 


GACAGTGT6G 


TGTAGTCTCC 


GCGAGCTCTG TGTGTGGCCC 


1860 


AGTGTACTGT 


TTCACCCCCA 


6CCCAGTAGT 


AGTGGGTACG 


ACCGATAGAC TTGGAGCGCC 


1920 


CACTTACACG 


TG6GGGGAGA 


ATGAGACAGA 


TGTCTTCCTA 


TT6AACAGCA CTCGACCACC 


mo 


GCA6GGGTCA 


TGGTTCGGCT 


GCACGTGGAT 


GAACTCCACT 


GGCTACACCA AGACTTGCGG 


2040 


CGCACCACCC 


TCCCGCATTA 


GAGCTGACTT 


CAATGCCAGC 


ATGGACTTGT TGTGCCCCAC 


2100 


GfiACTGTTTT 


AGGAAGCATC 


CTGATACCAC 


CTACATCAAA 


TGTGGCTCTG GGCCCTGGCT 


2160 


CACGCCAAGG 


T6CCTGATC6 


ACTACCCCTA 


CAGGCfCTGG 


CATTACCCCT flCACAGTTAA 


2220 


CTATACCATC 


TTCAAAATAA 


GGAT6TATGT 


GGGG6GGGTC 


GAGCACAGGC TCACGGCTGC 


2280 


GTGCAATTTC 


ACTCGTGGGG 


ATCGTTGCAA 


CTTGGAGGAC 


AGAGACA6AA GTCAACTGTC 


2340 


TCCTTTGCTG 


CACTCCACCA 


CGGAGTGGGC 


CATTTTACCT 


TGCACTTACT CGGACCTGCC 


2400 


CGCCTTGTCG 


ACTGGTCTTC 


rCCACCTCCA 


CCAAAACATC 


GTGGACGTGC AATTCATGTA 


2460 


TGGCCTATCA 


CCTGCTCTCA 


CAAAATACAT 


GGTCCGAT36 


6A6TG6GTAG TACTCTTATT 


2520 


CCT8CTCTTA 


GCGGACGCCA 




If Hit, 1 1 fllWJ 


A TfiTTPt TCT TflTTGGKPrH 
KlW/l vft IVI ivll utaal#l>H 




GGCCGAAGCA 


GCACTAGAGA 


AGTTGGTCGT 


CTTGCACGCT 


SCGAGCGCAG CTAGCTGCAA 


2640 


TGGCTTCCTA 


TACTTTGTCA 


TCTTTTTCGT 


GGCTGCTTGG 


TACATCAAGG 6TC666TAGT 


2700 


CCCCTT6GCT 


ACTTATTCCC 


TCACTGGCCT 


ATGGTCCTTT 


GGCCTACTGC TCCTAGCATT 


2760 


6CCCCAACAG 


fiCTTATGCTT 


ATGACGCATC 


TGTACATGGT 


CAGATAG6AG CAGCTCTGTT 


2620 


G6TACTGATC 


ACTCTCTTTA 


CACTCACCCC 


CGGGTATAAG 


ACCCTTCTCA GCCGGTTTCT 


2660 


GTG6TGGTTG 


TfiCTATCTTC 


TGACCCTGGC 


GGAAGCTATG 


GTCCAG6AST GGGCACCACC 


2940 
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TATGCAGGTG CGCGGTGGCC GTGATGGGAT CATATGGGCC GTCGCCATAT TCTGCCC6GG 3000 
TGT6GTGTTT GACATAACCA AGTGGCTCTT GGCG6TGCTT GGGCCTGCTT ATCTCCTAAA 3060 
AGGTfiCrTTG ACGCGTGTGC CGTACTTCST CAGGGCTCAC GCTCTACTAA GGATGTGCAC 3120 
CATfiGTAAGG CATCTCGC6G 6GGGTAGGTA CGTCCAGATG GTGCTACTAG CCCTTGGCAG 3180 
GTGGACTGGC ACTIACATCT ATGACCACCT CACCCCTATG TCGGATTGGG CTGCTAATGG 3240 
CCTGC6GGAC TTGGCGGTCG CCGTGGAGCC TATCATCTTC AGTCC6AT6G AGAAAAAAGT 3300 
CATCGTGTGG GGAGCGGAGA CAGCTGCTTG CGGGGATATC TTACACGGAC TTCCCGTGTC 3360 
CGCCCGACTT GGCCGGGAGG TCCTCCTTGG CCCA6CTGAT GGCrATACCT CCAAGGGGTG 3420 
GAGTCTTCTC GCCCCCATCA CTGCTTATGC CCAGCAGACA CGCGGCCTTT T86GCACCAT 3460 
AGTGGrGAGC AT6ACGGG6C 6CGACMGAC AGAACAGGCC GGGGAGATTC AGGTCCTGFC 3540 
CACfiGTGACT CAGTCCTTCC TCGGAACAAC CATCTCGGGG GTCfTATGGA CTGTCTACCA 3600 
TGGAGCTGGC AACAAGACTC TAGCCGGCTC ACG6GGTCCG GTCACACAGA TGTACTCCAG 3660 
TGCT6A6GGG GACTTAGT6G GGTGGCCCAG CCeCCCCGGG ACCAAATCTT TGGA6CCGTG 3720 
CACGTGTGGA GCGGTCGACC IATACCT6ST CACGCGAAAC 6CTGATGTCA TCCCGGCICG 3780 
AA6ACGCGGG GACAAGCGAG GAGCGCTACT CTCCCCGAGA CCTCTTTCCA CCTTGAAGGG 3840 
GTCCTCGGGG GGCCCGGTGC TCTGCCCCAG AGGCCACGCT GTCGGGGTCT TCCGG6CAGC 3900 
CGTGT6CTCC CGGGGCGT6G CCAAGTCCAT AGATNTATC CCCGTTGAGA CACTTGACAT 3960 
CGTCACFCGG TCCCCCACCT TTASTGACAA CAGCACACCA CCT6CTGTGC CCCAAACTTA 4020 
TCAGGfCGGG TACTTACATG CCCCGACTGG TAGTGGAAAG AGCACCAAAG TCCCTGTCGC 4080 
GTATGCCGCT CAGGGGTACA AAGTGCTAGT GCTTAATCCC TCGGTGGCTG CCACCCTGGG 4140 
GTTTGGGGCG TACTTGTCCA AGGCACATGG CATCAATCCC AACATTAG6A CTGGGfifCAG 4200 
GACTGTGACG ACCGGGGCGC CCATCACGFA CTCCACATAT GGCAAA1TCC TCGCCGATGG 4260 
GGGCTGCGCA GGCGGCGCCT ATGACATCAT CATATGCGAT GAATGCCATG CCGTGGACTC 4320 
TACCACCATT CTCGGCATCG GAACAGTCCT CGATCAAGCA GAGACAGCCG GGG1CAGGCT 4380 
AACTGTACTG GCTACGGCTA CGCCCCCCGG GTCAGTGACA ACCCCCCACC CCAACATAGA 4440 
GGAG6TGGCC CTCGGGCAGG AGGGTGAGAT CCCCTTCTAT GGGAGG6CGA TTCCCCTGTC 4500 
ATACATCAAG GGAGGAAGAC ACTTGATCTT CTGCCACTCA AAGAAAAA6T GTGACGAGCT 4560 
CGCGGCGGCC CTTCGGGGTA TGGGCTTGAA CGCAGTGGCA TACTACAGAG GGCTGGACGT 4620 
CTCCGTAATA CCAACTCA6G GAGACGTAGT GGTCGTCGCC ACCGACGCCC TCATGACGGG 4680 
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G7TTACTGGA fiACTTTAACT CCGTGATCGA CTGCAACGTA GCSGTCACTC AAGTTGTAGA 4740 
CTTCAGCTTG GACCCCACAT TCACCATAAC CACACAGACT GTCCCTCAAG ACGCTGTCTC 4800 
ACGTAGCCAG CGCCGGGGCC GCACGGGCAG GGGAAGACT6 GGTATTTArA GGTATCTTTC 4860 
CACTGGTfiAG C6AGCCTCAG GAATGTTTGA CAGT6TAGTG CTCTfiCGAGT GCTACGATGC 4920 
AflGGGCCGCA TGGTATGAGC TCACACCAGC GGA6ACCACC GTCAGGCTCA GAGCAFATTT 4960 
CAACACACCT GGTTTGCCTG 7GTGCCAAGA CCATCITGAC TFTTGGGAGG CAGTTTTCAC 5040 
CGGCCTCACA CACATAGATG CCCACTTCCT TTCCCAAACA AAGCAATCGG 6GGAAAATTT 5100 
CfiCATACTTA ACAGOCTACC AGGCTACAGT GTGCGCTAGG GCCAAAGCCC CCCCCCCGTC 5160 
CTGGGACGTC ATGTGGAAGT GTTTfiACTCG ACTCAA6CCC ACACTCGTGG GCCCCACACC 5220 
TCTCCTGTAC CGCTTGGGCT CTGTTACCAA CGAGGTCACC CTCACGCA7C CTGT6ACGAA 5280 
ATACATCGCC ACCTGCATGC AAGCCGACCT TGAGGTCATG ACCAGCACGT GGGTCrTAGC 5S40 
TGGGGfiGGTC TTGGCGGCCG TCGCCGCGTA CTGCCTGGCG ACCGGGTGT6 TTTGCATCAT 5400 
CGGCC6CTTG CACGTTAACC AGCGAGCCGT CGTTGCACCG GACAAGGAGG TCCTCTATGA 54SO 
GGCTTTTGAT GAGATGGAGG AATGTGCCTC TAGAGCSGCT CTCATTGAAG AGGGGCAGCC 5520 
GATAGCCGAG ATGCTGAAGT CCAAGATCCA AGGCTTATTG CAGCAAGCTT CCAAACAAGC 55«0 
TCAAGACATA CAACCCGCTG TGCAGGCTTC TTGGCCCAAG GTAGAGCAAT TCTGGGCCAA 5040 
ACACATGTGG AACTTCATCA GCGGCATTCA ATACCrCGCA GGACTATCAA CACTGCCAGG 5700 
GAACCCTGCT GTAGCTTCCA TGATGGCATT CAGTGCCGCC CTCACCAGTC CGTT6TCAAC 5760 
TAGCACCACT ATCCTTCTCA ACATTTTGGG GGGCTGGCTA GCATCCCAAA TTGCGCCTCC 5820 
CGCGGGGGCT ACCGGCTICG TCGICAGTGG CCTGGTGGGG GCTGCCGTAG GCAGCATAGG 54*0 
CTT66GTAAG GTGCTGGTGG ACATCCTGGC AGGGTATGGT GCGGGCATTT CGGGGGCTCT 5940 
CGTCGCATIC AAGATCATGT CTGGCGAGAA GCCCTCCATG GAGGATGTTG TCAACCIGCT 6M0 
GCCTG6AATT CTGTCTCCGG GTGCCCTG6T GGTGGGAGTC ATCTGCGCGG CCATCCTGCG 6060 
CCGACACGTG GGACCGGGGG AAGGCGCTGT CCAATGGATG AATAQGCTCA TTGCCTTTGC 6120 
TTCCAGAGGA AACCACGTCG CCCCCACCCA CTACGTGACG GAGTCG6ATG CGTCGCAGCG 6180 
TGTGACCCAA CTACTFGGCT CCCTTACCAT AACCAGCCTG CTCAGGAGAC ICCACAACTG 6240 
GATTACTGAA GACTGCCCCA TCCCATGCAG CGGCTCGTGG CTCCGCGATG TGTGGGATTG 6300 
GGTITGCACC ATCCTAACAG ACTTTAAAAA CTGGCTGACC TCCAAATTGT TCCCAAAGAT 6360 
GCCTGGTCTC CCCTTTATCT CTTGTCAAAA GGGGTACAAG GGCGTGTGGG CTGGCACTGG 6420 
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7A7CA7GACC ACACGGTGTC C77GC66C6C CAATflTCTCT flflCAATGrCC GCC7GGGC7C 6480 
CATCA6AATT ACGGGGCCCA AAACCTGCAT GAA7ATC7GG CAGGGGACCT TTCCCATCAA 6540 
TT6T7ACACG GAGGGCCAG7 GC67GCCGAA ACCCGCACCA AACT7TAAGA TCfiCCATCTG 6600 
GAGGGTGGCG GCCTCAGAGT ACGCG6AGGT GACSCAGCAC GGSTCATACC ACTACATAAC 6660 
AGGACTTACC ACFGATAACT TGAAAGTTCC TTGCCAACTA CCTTCTCCAG A6TTCTTTTC 6720 
CTGflGTGGAC CGAGTGCAGA TCCATAGGTT T6CCCCCATA CCGAAGCCG7 TT7TTCGGGA 6730 
TGAGGFCTCG TTCTGCGTTG GGC7TAATTC ATTTGTCGTC GGGTCTCAGC TCCCTTGCfiA 6*40 
TCCTGAACCT GACACAGAC6 TATTGACGTC CAT6CTAACA GftCCCATCCC A7A7CACG6C 6900 
GGAGAC7GCA fiCGCGGCGTT rGGCACGfiGG GTCACCCCCG 7CCGAGGCAA GCTCCTCA6C 6960 
CAGCCAGCTA TCGGCACCAT CGCTGCGAGC CACCTGCACC ACCCACGGCA AGGCCTATGA 7020 
7GT6GACA7G 6TGGATGCCA ACC7GT7CA7 GGGGGGCGA7 G75ACCCGGA TAGAGTCTGA 7080 
G7CCAAAG7G G7CG77C76G AC7C7C7CGA CCCAA7GGTC GAAGAAA6GA GC6ACC77GA 7140 
GCC77CGA7A CCA7CG6AA7 A7A7GC7CCC CAASAAGA6A TTCCCACCAG CC77ACCGGC 7200 
77GGGCACGG CCTGA7TACA ACCCACCGCT TGT86AA7CG TGSAAGAG6C CAGATTACCA 7260 
ACCGGCCAC7 6TT6CGGGCT 6CGC7C7CCC CCCCCC7AAG AAAACCCCGA CGCCTCCCCC 7320 
AAGGAGACGC CGGACAG7GG G7C7GAGTGA GAGCTCCA7A 6CAGA7GCCC 7ACAACAGCT 7340 
GGCCA7CAAG 7CC777GGCC AGCCCCCCCC AAGC66C6A7 TCAG6CC777 CCACGG6G6C 7440 
GGACGCAGCC 6A77CCGGCA G7C6GACGCC CCCCGA7GAG TTGGCCCTTT CGGAGACAGG 7540 
7TCCA7C7CC 7CCATGCCCC C7C7CGAG6S GGAGCC7G6A GA7CCAGAC7 TG6A6CCTGA 7560 
GCAGG7AGAG C77CAACC7C CCCCCCAGGG GGSGG7GGTA ACCCCCGGC7 CACGC7CGGG 7620 
G7C7TGG7C7 ACT7GC7CCG AGGAGGACGA C7CC67CG7G 7GCrGC7CCA 7G7CA7ACTC 7680 
C7GGACCGGG GCTCTAA7AA C7CC77GTAG CCCC6AAGAG GAAAAGTTOC CAATT66CCC 7740 
C7TGAGCAAC 7CCC7GnGC GA7A7CACAA CAAGGTG7AC 7G7ACCACA7 CAAA6AGCGC 7600 
C7CA77AAG6 GCTAAAAA6G 7AAC7777GA TAGGA7GCAA GCGC7CGACG C7CA77A7GA 7860 
C7CAG7C7TG AAGGACA77A AGCTAfiCGGC CTCCAAGG7C ACCGCAAGGC 77C7CACTTT 7920 
AGAGGAGGCC 76CCAG7TAA CTCCACCCCA C7C7GCAAGA 7CCAAGTA7G GG7T7GGGGC 7960 
7AAGGA4SG7C CGCAGC7TG7 CCGGGAGAGC CGT7AACCAC A7CAAG7CCG 7GTGGAAGGA 8040 
CC7CCTGGAA GACACACAAA CACCAA77CC 7ACAACCA7C A76GCCAAAA A7GAGG7G77 8100 
C7GCG7GGAC CCCACCAAGG GGGG7AAGAA AGCAGC7CGC CT7A7CGn7 ACCC7GACC7 8160 
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CGGCGTCAGG 


GTCTGCGAGA 


AAATGGCCCT 


GGTGATGGGG 


GCTTCTTATG 


GATTCCAGTA 


GAAGGCATGG 


nn nn a a a j a 4 

GCGGAAAAGA 


t jl m\ a r\ Aft Td V 

AAGACCCTAT 


CTCAACCGTC 


ACTfiAGAGAG 


ACATCAGGAC 


GCCCGAGGAG 


GCCCACACTG 


CCA7ACACTC 


CATGTTCAAC 


AGCAAGGGCC 


AGACCTGCGG 


TACCACTAGC 


ATGGGGAACA 


CCATCAfiATG 


TGCA6GGATA 


ATTGCGCCCA 


CAATGCTGG7 


GAGCCASGflG 


ACCGAGGAGG 


ACGAGCGGAA 


GTATTCTGCC 


CCTCCTGGT6 


ACCCCCCCAG 


7TGCTCCTCA 


AA7GTGTCTG 


TGGCGTTGflG 


CAGAfiACCCT 


ACCAC7CCAA 


TCGCCCGGGC 


CAATTCATG6 


CTAGGAAACA 


TCATCCAGTA 


GATGACACAC 


TTCTTCTCCA 


TTCTCATGGC 


TGAGATGTAC 


GGAGCGGTGT 


ACTCCGTGAG 


GTTACACGGG 


CTTGACGCTT 


TCTCTCTGCA 


GGCTTCAGCC 


CTCAGAAAAC 


TTGGGGCGCC 


TGCAGTCAGG 


GCGTCCCTCA 


TCTCCCGTGG 


CTTCAACTGG 


GCGGTGAAGA 


CCAAGCTCAA 


GfiATTCATCC 


AGCTGGTTCA 


CTGTCGGCGC 


GCGTGCCCGA 


CCCCGC7TAT 


TACTCCTTGG 


TTTCCTACTC 


CCCGCTCGGT 


AGAGCGGCAC 


cttttttttt 


TTTTTTTTTT 


TTTTTTTTTT 


TTTTTT7TTT 


TTTTTTTTTT 


TTTTTTTTTT 



TTATGATATC ACACAAAAfiC TTCCTCAGGC 8220 
CTCCCCCGC7 CAGCGGGTGG AGT7TCTCTT 4280 
GGG77TTTCG TATGA7ACCC GA7GCTTTGA 8340 
TGAGGAGTCC ATATATCG6G CTTG7TCCTT 8400 
ACTGACTGAG AGACTTTACG TGGflAGGGCC 8460 
G7ACAGGCG7 TGCCGCGCCA GCGfiGGTGCT 4520 
CTATflTGMA fiCCT7AGCGG CCTGTAAGGC 8580 
ATGCfiGCGAT GAC77GGTTG TCATCTCAGA 8640 
CCTGAGAGCC TTCACGGAGG C7A7GACCAG 8700 
ACCGGAATAT GACCTGGAGC TGATAACA7C 8760 
CCCACAAGGC CGCCGCAGAT ACTACC7GAC S820 
7GCCTGGGAA ACAGT7AGAC ACTCCCCTGT 8880 
CGCCCCAACC ATATGGGCTC GCATGGTCCT 8940 
CCAAGATACT CTGGACCAGA ACCTCAAC7T 9000 
TCCCTTGGAC C7CCCAGCCA 7AAT7GAAAG 9060 
CACA7ACACT CCCCACGAAC TGACACGGGT 9120 
ACCCCTCAGA GCGTGGAAGA GCCGGGCACG 91 SO 
GGGGAGAGCG GCCGTTTGCG GCCGATATCT 9240 
ACTCACTCCA TTGCCflGAAG CGCGCCTCCT 9300 
C6GCGGGGGC GACATTTATC ACAGCGTG7C 9360 
CCTAC7CCTA CTTTTTGTATJ G6GTAGGCCT 9420 
ACATTAGCTA CACTCCA7AG CTAACTG7CC 9480 
77TTTTTTTT 777TT77777 T7TTTTTTT7 9540 
TTTTTTTTTT TTTTTTTTT 9589 
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Bi^'JOSS : c D N A to genomic R.N A 

rar 

G6CATTACCC CTGCACAGTT AACTATACCA TCTTCAAAAT AAG6ATGTAT GTGfiGGGGGG 60 
TCGAGCACAG 6CTCACG6CI GCGTGCAATT TCACTCGTGG GGA7CGTTGC AACTTGGAGG 120 
ACAGAGACAG AAGTCAACTG TCTCCITTGC TGCACTCCAC CACGGAGTGG GCCATITTAC 180 
CTTGCACTTA CTCGflACCTG CCCflCCTTGT CGACTGGTCT TCTCCACCTC CACCAAMCA 240 
TCGTGGACGT GCAATTCATG fATGGCCTAT CACCTGCTCT CACAAAATAC ATC6TCCGAT 300 
GGGAGTGGG7 AGTACTCTTA TTCCTGCTCT TAGGGGACGC CASGGTTTGC GCCT6CTTAT 360 
fiGATGCTCAT CTrSTTGGGC CAGGCCGAAG CAGCACTAGA GAAGTTGGTC GTCTTGCACG 420 
CTGCGAGCGC AGCTAGCT6C AATGGCTTCC TATACTTTGT CATCTTTTTC GTGGCTGCTT 4*0 
GGTACATCAA GGGTCGGGTA GTCCCCTTGG CTACTIATTC CCTCACTGGC CTATGGTCCT 540, 
TTGGCCTACT GCTCCTAGCA TTGCCCCAAC AG&CTTATGC TTATGACGCA TCTGTACATG 600 
GTCAGATAGG AGCAGCTCTG rTGGTACTGA TCACTCTCTT TACACTCACC CCCGGGTATA 660 
AGACCCTTCT CAGCCGGTTT CTGTGGTGGT TGTGCTATCT TCTGACCCTG GCGGAAGCTA 720 
TGGTCCAGGA GTGGGCACCA CCTAT6CAGG TGCGCGGTGG CCGTGATGGG ATCATATGGG 7&0 
CCGTCGCCAT ATTCTGCCCG GGFGTGGTGT TTGACATAAC CAAGTGGCTC TTGGCGGIGC 840 
TTGGGCCTGC TTATCTCCTA AAAGGTGCTT TGACGCGTGT GCCGTACTTC GTCAGGGCTC 900 
ACGCTCTAC7 AA6GATGTGC ACCATGGTAA GGCATCTCGC GGGGGGTAG6 TACGTCCAGA 960 
TGGTGCTACT AGCCCTTGGC AGGTGGACTG GCACTTACAT CTATGACCAC CTCACCCCTA 1020 
TGTCG6ATTG GGCTGCTAAT GGCCTGCGGG ACTTGGCGGT CGCCGTGGAG CCTATCATCT 1080 
TCAGTCCGAT GGAGAAAAAA 6TCATCGTCT GGGGAGCGGA GACAGCTGCT TGCGGGGAIA 1 140 
TCTTACACGG ACTTCCCGTG TCCGCCCGAC TTCGCCGGGA GGTCCTCCTT GGCCCA6CTG 1200 
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AT6GCTATAC CTCCAAGGGG TGGAGTCTTC TCflCCCCCAT CACTGCTTAT fiCCCASCAGA 1260 
CACGCGGCCT TTTGGGCACC ATAGTGGTGA GCATGACGGG GCGCGACAAG ACAGAACAGG 1320 
CCGGGGAGAT TCAGGTOCTG TCCACGGTCA CTCAGTCCTT CCTCGGAACA ACCATCTCG6 13*0 
GGGTCTTATG GACTGTCTAC CATGGAGCTG GCAACAAGAC TCIAGCCGGC TCACGGGGTC 1440 
CGGTCACACA GATGTACTCC AGTGCTGAGG GGfiACTTAGT GGGGTGGCCC AGCCCCCCCG 1500 
GGACCAAATC TTTGGAGCCG TGCACGTGTG GAGCfifiTCGA CCTATACCTG GTCACGCGAA 1560 
ACGCTGATCT CATCCCGGCT CGAAGACGCG GGGACAAGCG AGGAGCGCTA CTCTCCCCGA 1625 
GACCTCTTTC CACCTTSAAG GGGTCCTCGG GGGfiCCCGGT GCTCTGCCCC AGA6GCCACG 1680 
CTGTCGGGGT CTTCCQGGCA GCCGTGTGCT CCCG6GGC6T GGCCAAGTCC ATAGATTTTA 1740 
TCCCCGTTGA GACACTTGAC ATCGTCACTC GfiTCCCCCAC CTTTAGTGAC AACAGCACAC 1800 
CACCTGCTGT CCCCCAAACT FATCAfiGTCG GGTACTTACA TGCCCCGACT GGTAGTGGAA 1*60 
AGAGCACCAA AGTCCCTflTC GCGTATGCCG CTCA6GGGTA CAAAGTGCTA GTGCTTAATC 1920 
CCTCGGTGGC TGCCACCCTG GGGTTTGGGG CGTACTTGTC CAAGGCACAT GGCATCAATC 1980 
CCAACATTAG GACTGGGGTC AGGACTGTGA CGACCGGGGC GCCCATCACG TACTCCACAT 2040 
ATGGCAAATT CCTCGCCGAT GGGGGCTGCG CACGCGGCGC CTATGACATC ATCATATGC6 2100 
ATGAATGCCA TGCCGTGGAC TCTACCACCA TTCTCGGCAT CSGAACAGTC CTCGATCAAG 2160 
CAGAGACAGC CGGGGTCAGG CTAACTGTAC TGGCTACGGC TACGCCCCCC GGGTCAGTGA 2220 
CAACCCCCCA CCCCAACATA GAGGA6GTGG CCCTCGGGCA GGAGfiGTGAG ATCCCCTTCT 2280 
ATOGGAGGGC GATTCCCCTG FCATACATCA AGGGAGfiAAG ACACTTGATC TTCTGCCACT 2340 
CAAAGAAAAA GTGTGACGAG CTCGCGGCGG CCCTTCGGGG TATGGGCTTG AACGCAGTGG 240O 
CATACTACAG AGGGC7GGAC GTCTCCGTAA TACCAACTCA GGGAGACGTA GTGGTCGTCG 2460 
CCACCGACGC CCTCATGACG GGGTTTACTG GASACTTTGA CTCCGTGATC GACTGCAACG 2520 
TAGCGGTCAC TCAAGTTGTA GACTTCAGCT TGGACCCCAC ATTCACCATA ACCACACAGA 2580 
CTGTCCCTCA AGACGCTGTC TCACGTAGCC AGCGCCGGGG CCGCACGGGC AGGGGAAGAC 2640 
TGGGTATTTA TAGGTATGTT TCCACTGGTG AGCGAGCCTC AGGAflTGTTT GACAGTGTAG 27H0 
TGCTCTGCGA GTGCTACGAT GCAGGGGCCG CATGGTATGA GCTCACACCA GCGGAGACCA 2760 
CCGTCAGGCT CAGAGCATAT TTCAACACAC CTGGTTT6CC TGTGTGCCAA GACCATCTTG 2620 
A6TTTTGGGA GCA6TTTTC ACCGGCCTCA CACACATAGA TGCCCACTTC CTTTCCCAAA 2880 
CAAAGCAATC GGGGGAAAAT TTCGCATACT TAACAGCCTA CCAGGCTACA GTGTGCGCTA 2940 
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GGGCCAAAGC CCCCCCCCCG TCCTGGGACG TCATGTGGAA GTGTTTGACT CGACTCAAGC 30(H) 
CCACACTCGT GGGCCCCACA CCTCTCCTGT ACCGCTTGGG CTCTGTTACC AACGAGGTCA 3060 
CCCTCACGCA TCCTGTGACG AAATACATCG CCACCTGCAT GCAAGCCGAC CTTGA6GTCA 3120 
TGACCAGCAC GTGGGTCTTA GCTGGGGGGG TCTTGGCGGC CGTCGCCGCG TACTGCCTGG 3180 
CGACCGGGTG TGTTTGCATC ATCGGCCGCT TGCAC6TTAA CCAGCGAGCC GTCGTTGCAC 3240 
CGGACAAGGA GGTCCTCTAT GAGGCTTTTG ATGAGATGGA GGAATGTCCC TCTAGAGCGG 3300 
CTCTCATTGA AGAGGGGCAG CGGATAGCCG AGATGCTGAA GTCCAAGATC CAAGGCTTAT 3360 
T6CAGCAAGC TTCCAAACAA GCTCAAGACA TACAACCCGC TGTGCAGGCT TCTTGGCCCA 3420 
AGGTAGAGCA A7TCTGGGCC AAACACATfiT GGAACTTCAT CAfiCGGCATT CAATACCTCG 3480 
CAGGACTATC AACACTGCCA GGGAACCCTG CTGTAGCTTC CATGATGGCA TTCAGTGCCG 3540 
CCCTCACCAC TCCGTTGTCA ACTAGCACCA CTATCCTTCT CAACATTTTG GGGGGCTGGC 3600 
TAGCATCCGA AATTGCGCCT CCCGCGGGGG CTACCGGCTT CGTCGTCAGT GGCCTGGTGG 3660 
GGGCT6CCGT AGGCAGCATA GGCTTfiGGTA AGGTGCTGGT GGACATCCTfl GCAGGGTATG 3720 
GTGCGGGCAT ITCGGGGGCT CTCGTCGCAT TCAAGATCAT GTCTGGC6AG AAGCCCTCCA 3780 
TGGAG6ATGT TGTCAACCTG CTGCCTGGAA TTCTGTCTCC GGGTGCCCTG GTGGTG6GAG 3840 
TCATCTGCGC GGCCATCCTG CGCCGACACG TGGGACCGGG GGAAGGCGCT GTCCAATGGA 3900 
TGAATAGGCT CATTGCCTTT GCTTCCAGAG GAAACCACGT CGCCCCCACC CACTACGTGA 3960 
CGGAGTCGGA 3970 
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ATTCTGTCTC CGGGTGCCCT GGTGGTGGfiA GTCATCTGCG CGGCCATCCT GCGCCGACAC 60 
GTG6GACCGG G6GAAGGCGC TGTCCAATGG ATGAATAGGC TCATTGCCH TGCTTCCAGA 120 

GfiAAACCACG TCGCCCCCAC CCACTACGTG AC6SAGTC6G AIGCGTCGCA GCGTGTGACC 180 
CAACfACTTG fiCTCCCTTAC CATAACCAGC CTGCICAGGA GACFCCACAA CTGGAFTACT 240 

CAAGACTGCC CCATCCCATG CAGCGGCTCG TGGCTCCGCG ATGTGTGfifiA TTOfifiTTrfiC 300 

ACCATCCTAA CAGACTTTAA AAACTGGCFG ACCTCCAAAT TGTTCCCAAA GATGCCTGGT 360 

CTCCCCTTTA TCTCTTGTCA AAAGGG6TAC AAGGGCGTGT GGGCTGGCAC TGGTATCATG 420 

ACCACACG6T GTCCTTGCGG CGCCAATAtC TCT6GCAATG TCCGCCTGGG CTCCATGAGA 480 

ATTACGGflfiC CCAAAACCTG CAT6AATATC TGGCAGGGGA CCTITCCCAT CAATTGTTAC MO 

AC6GAGS6CC AfiTGCGTGCC GAAACCCGCA GGAAACTTTA AGATCGCCAT CTfiCAGGGTG 600 

fiCGGCCTCAG AGTACGCGGA GGIGACGCAG CACGGGTCAT ACCACTACAT AACAGGACTT 660 

ACCACFGATA ACTTGAAAGT TCCTTGCCAA CTACCTTCTC CAGAGTTCTT TTCCT6GGTG 720 

GACGGAGT6C AGAfCCATAG fiTTTGCCCCC ATACC6AAGC CGTTTTTTCG GGATGAGGTC 780 

TCGTTCTGCG TTGGGCTTAA TTCATTTGTC GTCGGGTCTC AGCTCCCTTG CGATCCTGAA «40 

CCTGACACAG AC6TATTGAC GTCCATGCTA ACAGACCCAT CCCATATCAC GGCGGAGACT 900 

GCAGCGCGGC GTTTGGCACG GGGGTCACCC CCGTGCGAGG CAAGCTCCTC AGCGAGCCAG 960 

CTATCfiGCAC CATCGCTGCG AGCCACCTGC ACCACCCACG GCAAGGCCTA TGATGT6GAC 1020 

ATGGT6GAT6 CCAACCTGTT CArQGGGGGC GATGT6ACCC GGATAGAGTC TGAGTCCAAA 1060 

GTGGTCGTTC TGGACTCTCT CGACCCAATG GTCGAAGAAA GGAGCGACCT TGAGCCTTCG 1140 

ATACCATCGG AATATATGCT CCCCAAGAAG AGATTCCCAC CAGCCTTACC GGCTTGGGCA 1200 
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CGGCCTGATT 


ACAACCCACC 


GCTTGTG6AA 


ACTGTT6CGG 


GCTGCGCTCT 


CCCCCCCCCT 


CGCC66ACAG 


TGGGTCTGAG 


TGAGAGCrCC 


AAGTCCTTTG 


GCCAGCCCCC 


CCCAA6CGGC 


gccgaffccg 


GCAGTC6GAC 


GCCCCCCGAT 


TCCTCCATGC 


CCCCTCTCGA 


GG6GGAGCCT 


GAGCTTCAAC 


CTCCCCCCCA 


GGGGGGGGTG 




CCGAGGAGGA 

vvuniiunuun 


CGACTCCGTC 




TAAPTCCTTG 


TAGCCCCGAA 




TGCGATATCA 


CAACAAGGTG 


AGGGOTAAAA 


AGGTAACTTT 


TGATAGGATG 


TTGAAGGACA 


TTAAGCTAGC 


GGCCTCCAAG 


GCCTGCCAGT 


TAACTCCACC 


CCACTCTGCA 


GTCCGCAGCT 


TGTCCGGGAG 


AGCCflTTAAC 


6AAGACACAC 


AAACACCAAT 


TCCTACAACC 


GACCCCACCA 


AGGGfiGGTAA 


GAAAGCAGCT 


AGGGTCTGCG 


AGAAAATGGC 


CCTTTATGAT 


UbUtiM III 1 


ATC/MTTPP/l 




TGGGCGGAAA 


AGAAAGACCC 


TATGGGTTTT 


6TCACTGA6A 


GAGACATCAG 


GACTGAGGAG 


GAGGCCCACA 


CTGCCATACA 


CTCACTGACT 


AACAGCAAGG 


GCCAGACCTG 


CGGGTACAGG 


AGCATGGGGA 


ACACCATCAC 


ATGCTATGTG 


ATAATTGCGC 


CCACAATGCT 


GGTATGCGGC 


G6GACCGAGG 


A66ACGAGCG 


GAACCTGAGA 



TCGTGGAAGA GGCCAGATTA CCAACCGGCC 1260 
AAGAAAACCC CGACGCCTCC CCCAAGGAGA 1320 
ATAGCAGATG CCCTACAACA GCTGGCCATC 1380 
GATTCAGGCC TTTCCAC6SG 6GC6GACGCA 1440 
GAGTTSGCCC TTTCGGAGAC AGGTTCCAIC 1500 
GGAGATCGAG ACTTGGAGCC TGAGCAGGTA 1560 
GTAACCCCCG GCTCAGGCTC GGGGTCTTGG 1620 
GTGTGCTGCT CCATGTCATA CTCCTGGACC 1660 
GAGGAAAAGT TGCCAATTGG CCCCTTGAGC 1740 
TACTGTACCA CATCAAAGAG CGCCTCATTA 1800 
CAAGCGCTCG ACCCTCATTA TGACTCAGTC 1860 
GTCACCGCAA GGCT7CICAC TTTAGAGGA6 1920 
AGATCCAAGT ATGGGTTT6G GGCTAAGGAG 1980 
CACATCAAGT CCGTGTGGAA GGACCTCCTG 2040 
ATCATGGCCA AAAATGA6GT GTTCTGCGTG 2100 
CGCCTTATCG TTTACCCTGA CCTCGGCGTC 2160 
ATCACACAAA AGCTTCCTCA 6GCGGT6ATG 2220 
GCTCAGCGG6 TGGAfiTTTCT CTTGAAGGCA 2280 
TCGTATGATA CCC6ATGCTT TGACTCAACC 2340 
TCCATATAIC GGCCTTGTTC CTTGCCCGAG 2400 
GAGAGACTTT ACGTGGGAGG GCCCATGTTC 2460 
CGTTGCCGCG CCAGCGGGGT GCTTACCACT 2520 
AAAGCCTTAG CG6CCTGTAA GfiCTGCAGSG 2580 
GATGACTTGG TTGTCATCTC AGAGA6CCAG 264* 
GCCTTCACGG AGGCTATGAC CAG 2693 
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Met Ser Thr Asn Pro Lys Pro Oln Arg Lys Thr Lys Arg Asn Thr 
5 10 15 

Asn Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin lie 
20 25 36 

Val Gly Gly Vaf Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly 
35 40 45 

Val Arg Aid Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly 
50 55 60 

Arg Arg Gin Pro lie Pro Lys Asp Arg Arg Ser Thr Gly Lys Ser 
65 70 75 

Trp Gly Lys Pro Gly Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly 
80 85 % 

Leu Gly Trp Ala Gly Trp Leu Leu Ser Pro Arg Gly ser Arg Pro 
95 100 105 

Ser Trp Gly Pro Asn Asp Pro Arg His Arg Ser Arg Asn Val Gly 
110 115 120 

Lys Val He Asp Thr Leu Thr cys fily Phe Ala Asp Leu Met Gly 
125 130 135 

Tyr He Pro Val Val Gly Ala Pro Leu Gly Gly Val Ala Arg Ala 
140 145 150 

Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Vai Asn Phe Ala 
155 160 165 
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Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser lie Phe Leu Leu Ala 

170 1 75 180 

Leu Lev ser cys He Thr Thr Pro Val Ser Ala Ala Glu Val Lys 

135 190 195 

Asn He Ser Thr Gly Tyr Het Val Thr Asn Asp Cys Thr Asn Asp 

200 205 210 

Ser (le Thr Trp Gin Leu Gin Ala Ala val Leu fi Is Val Pro Gly 

215 220 225 

Cys Val Pro cys Glu Lys Val Gly Asn Thr Ser Am Cys TrD He 

230 235 240 

Pro val Ser Pro Asn Val Ala Val Gin Gin Pro Gly Ala Leu Thr 

245 250 255 

Gin Gly Leu Arg Thr His He Asp Met Val Val Het Ser Ala Thr 

260 2fi5 270 

Leu Cys Ser Ala Leu Tyr Val Gly Asp Leu Cys Gly Gly Val Het 

275 260 285 

Leu Ala Ala Gin Het Phe He Val Ser Pro Gin His His Trp Phe 

290 295 300 

Val Gin Asp Cys Asn Cys Ser He Tyr Pro Gly Thr He Thr Gly 

305 310 315 

Wis Arg Het Ala Trp Asp Het Het Het Asn Trp Ser Pro Thr Ala 

320 325 330 

Thr Het He Leu Ala Tyr Ala Het Arg Val Pro Glu val Ha He 

335 340 345 

Asp He He Gly Gly Ala His Trp Gly Val Het Phe Gly Leu Ala 

350 355 360 

Tyr Phe Ser Het Gin Gly Ala Trp Ala Lys Val Val vat lie Leu 

365 370 375 

Leu Leu Ala Ala Gly Val Asp Ala Gin Thr His Thr Val Gfy Gly 
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380 385 390 

Ser Thr Ala His Asn Ala Arc Thr Leu Thr Gly Het Phe Ser Leu 

395 400 405 

Gly Ala Arg Girt Lys lie Gin Leu Me Asn Thr Asn Gly Ser Trp 

410 415 420 

His lie Asn Arg fhr Ala Leu Asn Cys Asn Asp Ser Leu His Tfir 

425 430 435 

Gly Phe Leu Ala Ser Leu Phe Tyr Thr His Ser Phe Asn Ser Ser 

440 445 450 

Gly Cys Pro Glu Arg Het Ser Ala Cys Arg Ser He Glu Ala Phe 

455 460 465 

Arg Val Gly Trp Gly Ala Leu Gin Tyr Glu Asp Asn Val Thr Asn 

470 475 460 

Pro Glu Asp Het Arg Pro Tyr Cys Trp His Tyr Pro Pro Arg Gin 

485 490 . 495 

Cys Gly Val Val Ser Ala Ser Ser Val Cys Gly Pro Val Tyr Cys 

500 505 510 

Phe Thr Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Lea Gly 

515 520 525 

Ala Pro Thr Tyr Thr Trp Gly Glu Asn Glu Thr Asp Val Phe Leu 

530 535 540 

Leu Asn Ser Thr Arg Pro Pro Gin Gly Ser Trp Phe Gly Cys Thr 

545 550 555 

Trp Het Asn Ser Thr Gly Tyr Thr Lys Thr Cys Gly Ala Pro Pro 

560 565 570 

Cys Arg He Arg Ala Asp Phe Asn Ala Ser Het Asp Leu Leu Cys 

5?5 580 585 

Pro Thr Asp Cys Phe Arg Lys His Pro Asp Thr Thr Tyr He Lys 

590 595 600 
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Cys Sly Ser Gly Pro Trp Leu Bp Pro Arg Cys Leu He Asp Tyr 

605 610 615 

Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asa Tyr Tltr Me 

620 625 630 

Phe Lys He Arg Het Tyr Val fily Giy Val Glu His Arg Leu Thr 

635 640 645 

Ala Ala Cys Asn Phe Thr Arg Gly Asp Arg cys Asa Leu Glu Asp 

650 655 660 

Arg Asp Arg Ser Gin Leu Ser Pro Leu Leu His Ser Thr Thr Glu 

665 670 675 

Trp Ala He Leu Pro Cys Thr Tyr Ser Asp Leu Pro Ala Leu Ser 

m 665 690 

Thr Gly Leu Leu His Leu His Gin Asn He Val ASP Val Gin Phe 

6% 700 705 

Het Tyr Gly Leu Ser Pro Ala Leu Thr Lys Tyr He val Arg Trp 

710 715 720 

Glu Trp Val Val Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg Val 

725 730 735 

Cys Ala Cys Leu Trp Het Leu lie Leu Leu Gly Gin Ala Glu Ala 

740 745 750 

Ala Leu Glu Lys. Leu Val Val Lea His Ala Ala Ser Ala Ala Ser 

755 760 765 

Cys Asn Gly Phe Leu Tyr Phe Val He Phe Phe Val Ala Ala Trp 

770 775 780 

Tyr He Lys Gly Arg Val Val Pro Leu Ala Thr Tyr Ser Leu Thr 

785 790 795 

Gly Leu Trp Ser Phe Gly Leu Leu Leu Leu Ala Leu Pro Gin Gin 

800 605 810 

Ala Tyr Ala Tyr Asp Ala Ser Val His Gly Gin He Gly Ala Ala 
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SIS 820 $25 

Leu Leu Val Leu lie Thr Leu Phe Thr Leu Thr Pro Gly Tyr Lys 
830 835 840 

Thr Leu Leu Ser Ara Phe Leu Trp Trp Leu Cys Tyr Leu Leu Thr 
845 850 555 

Leu Ala Gfu Ala Met val Gin Glu Trp Ala Pro Pro Met Gin Val 
m 865 470 

Arg Gly Gly Arg Asp Gly He He Trp Ala Val Ala He Phe Cys 
875 m m 

Pro Gly Val Val Phe Asp He Thr Lys Tn> Leu Leu Ala Val Leu 
m m soo 

Gly Pro Ala Tyr Leu Leu Lys Gly Ala Leu Thr Arg Val Pro Tyr 
905 910 915 

Phe Val Arg Ala His Ala Leu Leu Arg Het Cys Thr Het Val Ara 
920 925 930 

His Leu flla Gly Gly Ara Tyr Val Gin Met Val Leu Leu Ala Leu 
935 940 945 

Gly Arg Trp Thr Gly Thr Tyr He Tyr Asp His Leu Thr Pro Het 
950 955 960 

Ser Asp Trp Ala Ala Asfl Gly Leu Arg Asp Leu Ala Val Ala Val 
965 970 975 

Glu Pro He He Phe Ser Pro Het Glu Lys Lys Val He Val Trp 
980 985 990 

Gly Ala Glu Thr Ala Ala cys Gly Asp He Leu His Gly Leu Pro 
995 1000 1005 

Val Ser Ala Arg Leu Gly Ara Glu Val Leu Leu Gly Pro Ala Asp 
1010 1015 1020 

Gly Tyr Thr Ser Lys Gly Trp ser Leu Leu Ala Pro He Thr Ala 
1025. 1030 1035 
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Tyr Ala Gin Gin Thr Arg fify leu Leu Gly Thr He Val Val Ser 

1040 1045 1050 

Met Thr Gly Arg fist) lys Thr Glu Gin Ala Gly Glu lie Glu Val 

1055 1060 1065 

Leu Ser Thr Val Thr Gin Ser Phe Leu Gly Thr Thr He Ser Gly 

1070 1075 10SO 

Val Leu Trp Thr Val Tyr His fily Ala Gly Asn Lys Thr Leu Ala 

10*5 1090 1095 

Gly Ser Arg Gly Pro Val Thr Gin Het Tyr Ser Ser Ala Glu Gly 

1100 1105 1110 

Asp Leu Val Gly Trp Pro Ser Pro Pro Gly Thr Lys Ser Leu 6 lu 

1115 1120 1125 

Pro Cys Thr Cys Gly Ala Val Asp Leu Tyr Leu Val Thr Arg Asn 

1130 1135 1140 

Ala Asp val lie Pro Ala Arc Arg Arc Gly Asp Lys Arg Gly Ala 

1145 1150 1155 

Leu Leu Ser Pro Arg Pro Leu Ser Thr Leu Lys Gly Ser Ser Qly 

1160 1165 1170 

Gly Pro Val Leu cvs Pro Arg Gly His Ala Val Gly Val Phe Arg 

1175 1180 1155 

Afa Ala Val cys ser Arg Gly Val Ala Lys Ser He -Asp Phe He 



Asp Asn Ser Thr Pro Pro Ala Val Pro Gin Thr Tyr Gin Val Gin 

1220 1225 1230 

Tyr Leu His Ala Pro Thr Gly Ser Gly Lys Ser Thr Lys Val Pro 

1235 1240 1245 

Val Ala Tyr Ala Ala Gin Gly Tyr Lys Val Leu Val Leu Asn Pro 



1190 

Pro Val Glu Thr Leu 
1205 



1195 

Asp He Val Thr Arg Ser Pro Thr 
1210 



1200 



Phe Ser 
1215 
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1250 1255 1260 

Ser val Ala Ala Tnr Leu Gly Phe eiv Ala Tyr Leu ser Lys Afa 
1265 1270 12 75 

His Gly He Asn Pro Asn He Arg Thr Gly Val Arg Thr Val Thr 
1250 1285 1290 

Thr Gly Ala Pro lie Thr Tyr Ser Thr Tyr Gly lys Phe Leu Ala 
1295 1300 1305 

Asp Sly Gly Cys Afa fily Gly Ala Tyr Asp He He He Cys Asp 
1310 1315 1320 

Glu Cys His Ala Val Asp Ser Thr Thr He Leu Gly He Gly Thr 
1325 1330 1335 

Val La' Asp Gin Ala Glu Thr Ala Gly Val Arfl Leu Thr Val Leu 
1340 1345 1350 

Ala Thr Ala Thr Pro Pro Gly Ser Val Thr Thr Pro His Pro Asn 
1355 1360 1365 

He Glu Glu Val Ala leu Gly Gin Glu Gly Glu He Pro Phe Tyr 
1370 1375 13S0 

Gly Arg Ala He Pro leu Ser Tyr He Lys Gly Gly Arg His Leu 
1335 1390 1395 

He Phe cys His Ser lys lys Lys Cys Asp Glu Leu Ala Ala Ala 
1400 1405 1410 

Leu Arg Gly Het Gly Leu Asn Ala Val Ala Tyr Tyr Arg Gly Leu 
1415 1420 1425 

Asp val ser val He Pro Thr Gin Gly Asp Val val Val val Ala 
1430 1435 1440 

Thr Asp Ala Leu Het Thr Gly Phe Thr Gly Asp Phe Asp Ser Val 
1445 1450 1455 

He Asp Cys Asn Val Ala Val Thr Gin Val Val Asp Phe Ser Leu 
1460 1465 1470 

- 36 - 



( 30 ) 



lfBI^06-12l689 



Asp Pro Thr Pile Thr lie Thr Thr Gin Thr Val Pro Gin Asp Ala 

'475 1480 1485 

Val Ser Arg Stir Gin Arg Arg Gly Arg Thr Gly Arg Gly Arg Leu 

U90 1495 1500 

fify lie Tyr Arg Tyr Val ser Thr Gly Glu Arg Ala Ser Gly Met 

1505 1510 1515 

Phe Asp ser Val Yal Leu Cys Glu Cys Tyr Asp Ala Gly Afa Ala 

1520 1525 1550 

Trp Tyr Glu Leu Thr pro Ala Glu Thr Thr val Arg Leu Arg Ala 

1535 1540 1545 

Tyr Phe Asn Thr Pro Gly Leo Pro Val Cys Gin Asp His Leu fllu 

1550 1555 1560 

Phe Trp Glu Ala Val Phe Thr Gly Leu Thr His lie Asp Ala His 

1565 1570 1575 

Phe Leu Ser Gin Thr Lys Gin Ser Gly Glu Asn Phe Ala Tyr Leu 

1580 1685 1590 

Thr Ala Tyr Gin Ala Thr Val Cys Ala Arg Ala Lys Ala Pro Pro 

1595 1600 1605 

Pro Ser Trp Asp Val Het Trp Lys Cys Leu Thr Arg Leu Lys Pro 

1610 1615 1620 

Trp Leu Val Gly Pro Thr Pro Leu Leu Tyr Arg Leu Gly ser val 

1625 1630 1635 

Thr Asn Glu Val Thr Leu Thr His Pro Val Thr Lys Tyr Me Ala 

1640 1645 1650 

Thr Cys Het Gin Ala Asp Leu fllu Val Het Thr Ser Thr Trp Val 

1655 1660 1665 

Leu Ala Gly Gly Val Leu Ala Ala Val Ala Ala Tyr Cys Leu Ala 

1670 1675 1680 

Thr Gly Cys Val Cys lie He Gly Arg Leu His Val Asn Gin Arg 
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1685 1690 1695 

Ala val Val Ala Pro Asp Lys Glu Val Leu Tyr Glu Ala Phe Asp 

17M i?io 

Glu Het Glu filu Cys Ala Ser Arg Afa Ala Leu lie Glu Glu Gly 

1715 1720 1725 

Cln Arg He Afa Glu Het Leu Lys Ser Lys He Gin Gly Leu Leu 

1730 1735 1740 

Gin Gift Ala Ser Lys Gin Ala Gin Asp He Gin Pro Aia Val Gin 

1745 1750 1755 

Ata Ser rrp Pro Lys Val Glu Gin Phe Trp Ala Lys His Het rrp 

1760 1765 1770 

Asn Phe He Ser Gly He Gin Tyr Leu Ala Sly Leu Ser Thr Leu 

1775 17*0 1785 

Pro Gly Asn Pro Ala Val Ala Ser Het Het Ala Phe Ser Ala Ala 

1790 1795 1800 

Leu Thr Ser Pro Leu Ser Thr Ser Thr Thr lie Leu Leu Asn He 
1805 1810 1815 

Leu Gly Gly Trp Leu Ala Ser Gin He Ala Pro Pro Aia Gly Ala 
1820 1825 1830 

Thr Gly Phe Val Val Ser Giy Leu Val Gly Ala Aia Val Gly Ser 
1835 1840 1845 

He Gly Leu Gly Lys Val Leu Val Asp He Leu Ala Gly Tyr Gly 

1850 1855 1860 

Ala Gly He ser Gly Ala Leu Val Ala Phe Lys He Het Ser Gly 

1865 1870 1875 

Gin Lys Pro Ser Het Glu Asp Val Val Asn Leu Leu Pro Gly He 

1880 1885 1890 

Leu Ser Pro Gly Ala Leu Val Val Gly Val He Cys Ala Aia He 

1895 1900 1905 
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Leu Arg Arg His Val Gly Pro Giy Glu Gty Ala Val Gin Trp Met 

1910 1915 1920 

Ash Arg Leu He Ala Plw Ala Ser Arg Gly Asn His Yal Ala Pro 

1925 1930 1935 

Tfir His Tyr Vai Thr Glu Ser Asp Ala Ser Gin Arg Val Thr Gin 

1940 1945 1950 

Leu Leu Gly Ser Leu rhr He Thr Ser Leu Leu Arg Arg Leu His 

1955 I960 1965 

Asn Trp lie Thr Glu Asp Cys Pro He Pro Cys Ser Gly Ser Trp 

1970 1975 1980 

Leu Arg Asp Val Trp Asp Trp Val Cys Thr lie Leu Thr Asp Phe 

19-85 1990 1995 

Lys Asn Trp Leu Thr Ser Lys Leu Phe Pro Lys Met Pro Gly Leu 

2000 2005 2010 

Pro Phe He Ser Cys Gin Lys Gly Tyr Lys Gly Val Trp Ala Gly 

2015 2020 2025 

Thr Gly He Het Thr Thr Arg Cys Pro Cys Gly Ala Asn He Ser 

2030 203S 2040 

Gly Asu Val Arg Leu Gly Ser Het Arg He thr Gly Pro Lys Thr 

2045 2050 2055 

Cys Met Asn He Trp Gin Gly Thr Phe Pro He Asn Cys Tyr Thr 

206Q 2065 2070 

Glu Gly Gin Cys Val Pro Lys Pro Ala Pro Asn Phe Lys He Ala 

20T5 2080 2085 

He Trp Arg Val Ala Ala Ser Glu Tyr Ala Glu Val Thr Gin His 

2090 2095 2100 

Gly Ser Tyr His Tyr lie Thr Gly Leu Thr Thr Asp Asn Lei Lys 

2105 2110 2115 

Val Pro Cys Gin Leu Pro Ser Pro Glu Phe Phe Ser Trp Val Asp 
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2120 2125 2130 

Gly Vai Gin lie His Arg Plw Ala Pro lie Pro Lys Pro Phe Phe 

2135 2140 2145 

Arg Asp Glu Val Ser Phe cys val Gly Leu Asn Ser Phe Val Val 

2150 2155 2160 

Gly Ser Gin Leu Pro Cys Asp Pro Glu Pro Asp Tbr Asp Val Leu 

2 1 65 2170 2175 

Thr Ser Met Leu Thr Asp Pro Ser His He Thr Ala Glu Thr Ala 

2180 2185 2190 

Ala Arg Are Leu Ala Am 6Iy Ser Pro Pro Ser Glu Ala Ser Ser 

2105 2200 2205 

Ser Ala Ser Gin Leu Ser Ala Pro Ser Leu Arg Ala Thr Cys Thr 

2210 2215 2220 

Thr His Gly Lys Ala Tyr Asp Val Asp Het Val Asp Ala Asn Leu 

2225 2230 2235 

Phe Het Gly Gly Asp Val Thr Arg He Glu ser Glu Ser Lys Val 

2240 2245 2250 

Val Vai Leu Asp Ser Leu Asp Pro Het Vai Glu Glu Arg Ser Asp 

2255 2260 2265 

Leu Glu Pro ser He Pro Ser Glu Tyr Het Leu Pro Lys Lys Arg 

2270 2275 2280 

Phe Pro Pro Ala Leu Pro Ala Trp Ala Arg Pro Asp Tyr Asn Pro 

2285 2200 2295 

Pro Leu val Glu ser Trp Lys Arg Pro Asp Tyr Gin Pro Ala Thr 

2300 2305 2310 

Val Ala Gly Cys Ala Leu Pro Pro Pro Lys Lys Thr Pro Thr Pro 

2315 2320 2325 

Pro Pro Arg Arg Arg Arg Thr Val Gly Leu Ser Glu Ser Ser He 

2330 2335 2340 
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Ala Aso Ala Ui Sin Gin Leu Ala He Lys Ser Phe GIv Gin Pro 

2345 2350 2355 

Pro Pro Ser Gty Asp Ser GJy leu Ser Thr Gly Ala Asp Ala Ala 

2360 2365 2370 

Asp Ser Clly Ser Arg Thr Pro Pro Asp Glu Leu Ala Leu Ser Glu 

2375 2380 2335 

Thr Gly ser lie Ser ser Het Pro Pro Lett Glu Gly Glu Pro Gly 

2330 2395 2400 

Asp Pro Asp Leu Glu Pro Glu Gin Vat Glu Leu Gift Pro Pro Pro 

2405 2410 2415 

Gin Gly Gly Val Val Thr Pro Gly Ser Gly Ser Gly ser Trp Ser 

2420 2425 2430 

Thr Cys Ser Glu Glu Asp Asp Ser Val Val Cys Cys Ser Het Ser 

2435 2440 2445 

Tvr Ser Trp Thr Gly Ala Leu lie Thr pro Cys Ser Pro Glu Glu 

2450 2455 2460 

Glu Lys Leu Pro lie Asn Pro Leu Ser Asn Ser Leu Leu Arg Tyr 

2465 2470 2475 

His Asn Lys Val Tyr Cys Thr Thr Ser Lys Ser Ala Ser Leu Arg 

2480 2435 2490 

Ala Lys Lys Val Thr Phe Asp Arg Het Gin Ala Leu Asp Ala His 

2495 2500 2505 

Tyr Asp Ser Vai Leu Lys Asp He lys Leu Ala Ala Ser Lys Val 

2510 2515 2520 

Thr Ala Arg Leu Leu Thr Leu Glu Glu Ala Cys Gin Leu Tltr Pro 

2525 2530 2535 

Pre His Ser Ala Arg Ser Lys Tyr Gly Phe Gly Ala Lys Glu Val 

2540 2545 2550 
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Arg Sen Leu Ser Gly Ar« Ala Val Asn His He Lys Ser Vaf Trp 

2555 2566 2565 

Lys Asp Leu Leu Glu Asp Thr Gin Thr Pro He Pro Thr Thr fie 

2570 2575 2580 

Het Ala Lys Asn Glu Val Phe Cys Val Asp Pro Thr Lys fiiy Gly 

2585 2590 2595 

Lys Lys Ala Ala Arg Leu He Val Tyr Pro Asp Leu Gly Val Arg 

2600 2605 2610 

Val Cys Glu Lys Het Ala Leu Tyr Asp He Thr Gin Lys Leu Pro 

2615 2620 2625 

Gin Ala Val Het Gly Ala Ser Tyr Gly Phe Gin Tyr Ser Pro Ala 

2630 2635 2640 

Gin Arg Val Glu Phe Leu Leu Lys Ala Trp Ala Glu Lys Lys Asp 

2645 2656 2655 

Pro Het Gly Phe Ser Tyr Asp Thr Arg Cys Phe Asp Ser Thr Val 

2660 2665 2670 

Thr Glu Arg Asp lie Arg Thr Glu Glu Ser He Tyr Arg Ala Cys 

2675 2660 2685 

Ser Leu Pro Glu Glu Ala His Thr Aia lie His ser Leu Thr Glu 

2690 2695 2700 

Arg Leu Tyr Val Gly Gly Pro Het Phe Asn Ser Lys Gly flln Thr 

2705 2710 2715 

Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Thr Thr Ser 

2720 2725 2730 

Hot Gly Asn Thr He Thr Cys Tyr Val Lys Ala Leu Ala Ala Cys 

2735 2740 2745 

Lys Ala Ala Gly He He Ala Pro Thr Het Leu Val Cys Gly Asp 

2750 2755 2760 
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Asp Leu Vat Val lie Ser Glu Sep Gin Gly Thr Glu 6iu Asp Glu 

2765 2770 2775 
Arg Asn Leu Arg Ala Phe Thr filu Ala Het Thr Arg Tvr Ser Aia 

2780 2785 2790 

Pro Pro Gly Asp Pro Pro Arg Pro Glu Tyr Asp Leu Glu Leu He 

2795 2800 2405 

Thr Ser Cys Ser Ser Asn val Ser Val Ala Leu Gly Pro Gin Gly 

2810 2815 2820 

Arj Arg Arg Tyr Tyr Leu Thr Arg Asp Pro Thr Thr Pro He Ala 

2825 2880 2885 

Arg Ala Aia Trp Glu Thr Val Arg His Ser Pro Val Aso Ser Trp 

2840 2845 2850 

Leu Gly Asn lie He Gin Tyr Ala Pro Thr ile Trp Ala Arg Het 

2855 2880 2865 

Val Leu Het Thr His Phe Phe Ser He Leu Het Ala Gin Asp Thr 

2870 2875 2880 

Let) Asp Gin Asn Leu Asn Phe Glu Het Tyr Gly Ala Val Tyr Ser 

2885 2890 2895 

val ser Pro Leu Asp Leu pro Ala lie lie Glu Arg Leu His Gly 

2900 2905 2910 

Leu Asp Ala Phe Ser Leu His Thr Tyr Thr Pro His Glu Leu Thr 

2915 2920 2925 

Arg Val Ala Ser Ala Leu Arg Lys Leu Gly Ala Pro Pro Leu Arg 

2930 2985 2940 

Ala Trp Lys Ser Arg Ala Arg Aia Val Arg Ala Ser Leu lie Ser 

2945 2950 2955 

Arg Gly Gly Arg Ala Ala Val Cys Gly Arg Tyr Leu Phe Asn Trp 

2960 2965 2970 
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Ala Vat Lys Thr Lys Leu Lys leu Tfir Pro Leu Pro Glu Ala Arc 

29ft 298ft 2985 

Leu Leu Asp Leu Ser Ser Trp Plie Thr Val Gly Ala Gly Gly Gly 

2990 2995 3000 

Asp He Tyr His Ser Val Ser Ars Ala Are Pro Arg Leu Lett Leu 

3005 3010 3015 

Leu Gly Leu Leu Leu Leu W» Val Gly Val Gly Leu Pte Leu Leu 

3020 3025 3030 

Pro Ala Arg 
3033 
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Vj t ISSN'S) (DRN A0-T/l<X*7V A£D?P— Mb 
l=jfcgjL.fc£25^ L. z*iSSI=HCVtt*<DailSI6S 50 



MJgLfe. ££, HCVft#5l]3££li, i8JftuflL;S0>X9 
BIFj«ta)BiattAt«A*t-»r'ttS«i&*l«. Lfr 

u *+y7 , -¥>ettgn8Wiiffl*r*«i7o»6ic-r?-r, 

tiTHS. LfcAt^-C. *-fn>tt«>«H3BB5Bt«toT 
tNAN Bgf jK^OjStWl*flfjKt LT***ih.-t Lifc 

lv NANBifffta)ea[t^fiT»*y, ^^sssfei^j;^ 

[0004] 

-SciAvC,, B*li33l+-SNANBIfFi&0-f^X 

*v aa** -< p ^n<nmw Lfc h c v y / At i±&r l 
t-a-e-r. ^wit^fc-gitss^hfe. l 

fcAtoT. SSffil-teL^r^-blcS+LfcN ANB 

tt. if^-f^&llcNANBBT^^^^XffljifK^uK 

Tl»fc„ *fl«(D@6«ll*, NANBJffjfe'WAX^f/A 

*l^t»TN A N Bff&^AXHSJfiliU fitwroftffi 
mOffi <£ $ m %> Z. 1 15* § „ 



zim 
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£ 2 eHttftifffifil (SS?3-1 9 1 376f) 46 



[8IH£S*^£fc «>©¥&] ****6I±. nanb tfl=6£2 0Bttftmh8l=fc^T*<n-ffl£IMSL 

STi^^'f ;u^<DJ&*t. ^K. &2i::*or*&7K'J5^u fc„ *&E#6l;fc. £ 6l=ffift£iiifci. ^@hc-J6 

K$sm+t=»i=«ft»^£'3-j(+, -f-fflg*, t izour, afci=*i»sssa*3- K-r sawn c d 

h**tf^:"^y-*+'J7<0Jll;*J:yNANBF* NA^P-^ffl*i|t#l!aL. ZftdJ^T. HC-J 



fc 0 -troSS*$EBB#6li, NANBjff&O-f/L^-efc )=. 5' SSiSJ:y 3 4 OSffljl^U:*- KJ;y#3 5' # 
3HC-J6y;A<D&g^-£tt;<D»£gE5i|££±l=S 10 3-Kffl«S*L. -t*ilcttlvc#3ftSait£=i- 

ATf<DW36l=J:oT*«f!Br*aB6*ih.fc. *£W<DNA ^f-S. «*. 3' *«ffltl*> #fc&SaH£=i- h'f 

NBJffjfc-W^X (HC-J 64V A) l±, *-<P>tt 4««CtS^T 1 0 8«*(CU-7, hL-?^£*fc'1 5 

fflHCV^rV A£liJ5H*ro*<0T?*S. OSilO^a-h'fflKSSt*. 5' *S|»b3 4 1 

{0 0 0 6] *«8Bli. E9JfreiSCa>££EH££ *g<DTT->J:y*6SS=i-Kfi*l±9 0 9 9a*<D 

f-^^A^BSfF^-r/uxjie^igM-efcy. rn -*©«i^-^>'j-f^ >y7^-A*yfty. r 

A 5' m&£y3 4 0ffiOT2?U*^h*fr6£.&#::- *i^3 0 3 37S^K*3-KL-C^SCt6tMt>^t 

-Ki-^^ias^trgoggaws^u^-^K. 46t* ^«it^6:7^e^ji,*r=a«T?&6£#;L6*i.-ci* 
1=3' xmi oairou^h^f^tM sog<os^ 20 *S6W©n a NBiffiS^-f ^xftfiT-as- Kfll 

SB-Cfiy. 8Htf-«iBlft£*r*$gM#BS . ns-1 (gNtiftXa-1) , ns-2 (Bl- 

jff3fi^-r^x*jM*aeT-<D»WT?fe5 g sfc, 13ns 2) . ns-3 as- 3) , ns-4 tn— 4> , ns 

8 9, EOT»#3jE««)S*K5ll*#1-6cDNA^D [000 9) *ft«CDHC-J 6?J il*. 3-Qy/< 

-l/J 6-08 1. £$US«4gaaoS££jU$4r*i> f#i*aiJ8fS3 8 8, 232f <*-f P>tt) IC^SHfc 

oDNAi'n— >j 6 — 08, Z#t6#X*U**Kt HCvy/ii:H;8t4i, *9 KKSIK-aUT 

»ifW«ft**^-S#A*BSffF*'>'f l\>7,mm*>? ±ft®V&G7. 996<DtgHtt£, SfcT 5 / BEMI:: 

E«OT7'5/®ffi5i|S%r*JH , J^^5'-(' KP-303 -Kffi*lcfeL>T«i|E5i|-e9 4. 4«tt. 3 7*ffiJ*l=8 

3, SrEB&P, sK'J*^ K. cDNA>D- > L^T7 5/'®E5iJ-e9 0. 1 %T?*y . ZtL6©ffl«-p 

<n±ai!*t=fi-»$s±iBii&i=a»a*ji5ffl$-a-c»fc (±tt««ss^«i5itts*u-c^s. isstj, x>^o-^ 

^f'J^^-f K. fllTE^'J^^-f K(=*ft*i|«'J* S|I»«fcyT3S<DlISETf)iE. NS-1, NS-2. NS 

□ -+JKS(*:Sfcttt/^D-7-JUin;ft«)96liT?a5l». -3, NS-4, N s- SflD^^tifflTS/KE^lT? 

S 6f=. WE&tfM^S-r K£ffll*fc#A#BMK* 60. 4%_ 71. 1%, 57. 8%, 81. 1%, 7 

SWS. !BE#tS#*fflL»fcSfA*BaBF3ft»tilS<038 3. 1%, 6 9. 9%, 3' *fll#=t- K^fflSSE 

s-es^o 5"ii?2 9. 6%tL^-r*itffiL\ffi^tt£^-fiz-r^-f , 

[0 0 0 7] *ft«fflNANSlffift'7^X«g?-lt3fc SSS. H C - J -f P ^ttCDfgJI LfcH C Vttt 

<DA3fcLT#T. *<DS*EM£»3£Lfc. t l»J3<J: 40 ^<S&&Ct*^4fri:6ofc, 

tf*V/^y-]fiL!KJ:y»fcl*f*. HC-J1, HC- [0 0 1 0] *£IS#6HI=&£Lf- (Japan. 

J 4, H C- J 6$£ll*T*fflF3££iitf>fc„ HC-J J. Exp. Mod. , Vol. 60, 3. 167-1 

1. HC- J 6!iB*AttltL%&*T?, l»fjltHCV 7) HC — J 1 (7/ 'J AS) . HC-J 4 (B*S) 

tS(*Plttt*!l£*4ifcfttt:J:yfflfc. HC — J 4 UNA 1*t©7S /»E5IJ<DItKT?[*. =i78MS-ei±9 09661 

NBffFiS©SS*ttS«tffiLfc^>/i^^-AvbBfcS6ft; ±(BffiHt4SSl-fl)lt*tLT, x>*p-:/ffi«T*ii-e 
KSJlEf&tf, L^LCftliA-f P>|±OT#riBroHCV 9%. 53. 1 HtUl^WHtisp-ttZ-t 

tSflCTJIi (-) ?fc^fc„ CW3«©jti«giafl: < fcyRN #ft*»-3fc. **ffll=J=y, HC-J6W1 HC-J 

ASfflfiL, tL I ck ^> T R N AOTg&E?! Sffl'* 1«t, HC- J 4»©Lvf ftttSfcSatgfl)* * 
fc„ :«)HRNA(»5' *iS«!f<J>*tl 2 5 0 0, fcj:tf Z t tl-afc. 

3' 1 0 0fflS*E5<JI=O^ri*Btt*#4 SO [00 11] 



ffiHt^W £ A / i-f- -5 c d n a s 9 p - ^ ib L r *a> ± 

fflOia*E5iJ£a£L, $ 61=75 >»E5i|$*^L 



6lrVA03SifiRNA9 5 8 9Sa(0S?JroHflS£S 



( 44 ) 



t#M*F06-121689 



fttt?**. *KBB<njK'J^^$-< N A N B SiffF^i 

*/*n-^-;HS#li. #A#BS!IFFjKttffi5&±: LTffl 
[0012] 10 

[0 0 13] 3S}6«1 
( 1 ) RNAtttM 

B*A«jfli#a)iiiiK*&»fc, HevttttKte (sh-v 

• VJTVSXTIV'J • WfAXtiS. t-VHC 
VAbEL I SA + 'vH <t2H^**ifc«S»:HC~J 
1. HC-J 6i3<i:tfN ANBfFUJOSiftSitaLfcf 1 
WO';-*6HfcHCVfi# (-) (DttjtHC-d 4 20 

*y. aro^^iiLrRNA^tttULfc,, jflusrzhyx 

J£»iS«r;£ (1 OmM, p H8. 0) ZHOU.. 6 8X1 
0 a r pm-C1B#raa-6Lf=. »b*lfcA*l/7t-l:2 0 
OmM NaC I . 1 OmM EDTA, 2% (w/ 
v) K-ri/^UaiS^-J-iJ^i* (3DS) . teJitflmg 

/ m I S)?Oft — "tfK I" "J XtSStjSflBJS (5 0 

mM. pH8. OJ £*0*, 6 Ot-C 1 ^BfliDiSL, I 

*y-M»SfH\ RNAgSfc. 
[0014] (2) HC-J1cDNA^ 

HC-J 1j4L»J:yiiHiLfeRNA£ 7 o*CTM #WiiD 30 

DNA Synthesis SystomPlus, 

(2 O-me r) 20pmol £linx.-C, 4 2°C. 15 
SraSf&S-e-CcDNA^Wfco 

3-Py/«l$l¥tljfl5fiM3 1 8, 2 1 6?[:$JftfcH 
CVffi&E$i|£#S8LT£fi£Lfc#8 (5' -GATG 
CTTGCGGAACGCAATCA-3' ) SfflU 

[0015] (3) fK'Jj<-7— tf^i-f>'JT^v3> 40 



(PGR) KcfcSc DNAOTfUS 
Gene Amp O N A^WKIH* V h (/i— 
4,7— . >>— 9^^) SfflWzSa i k i £©£& (S 
cianc* 239. 487-49 1, 1 938) |: 
<MJ. DNA1t-7-<97- (A— +^1^7— ■ 
IZT3 ^Ufflc DNA#IS£fTofc„ 

[0016] (4) cDNA^* 7^'J-©#fgg[=J:5 
HC-J1. HC-J4ID5' $Wflli££E9I©i%£ 
oDNA^-f^'J-Sffll^HC-J 1^/^(95' 
*i«HM©iSeE5>J»*rl*B2 {HC-JD 03 (HC 
-J 4) IZTp-f&VlZ, cDNA^fUt77-v 
Jet 1 oi=»ALT»fc*n-^©»Wat;o DNA 
£ P C Rf=-C«l«LTI§6ih.fc-> P->©ffl*fSS©i5g 
#*W#T3feSLfc 0 @2fcJ:tf[g3l*NANB|ff&'7 

^T'J + ^7-e?A g t 1 Offl^^'J-WS^P 
->-e«JSEMS*SUfcS!fflS. jS«l±PCRt=J:S 
$D->T?**EOTt*£Lfcifill£*-*. HC-J 1 
©nt454~2109©165 6«£f±77-f 7— # 
8frb»fccDNA$^ g t 1 QV7-*}*99- (7 

n t 824~843©grUL^ : 5-f7— #25 C5' - 
TCCCTGTTGCATAGTTCACG— 3' ) £ 
ffll*T. HC-J4©oDNA7'f77 l HA^I*4 
Off) c DNA9Q — >^ 60, 061, 066. 075 
£$T. dit,©=i>fe>-9-X ->-^x>XA^b±g!l 
©n t 1 8-8 4 3©fi«E5l)S»«>fc. 
[00 17] (5) HC-J 6. 5' *9S<HiS4SE5l|© 

HC-je^rViros' *««i©£SE5*iiii8 4 

•fc^fZ. cDNA£PCRKT«l®L-C£€>;hfc?P- 

X. fflfflLfcRNAfrblffliS (2) ©TJjSlzftoT. c 
DNAJttffiL. tffly-Jl^iSSLfc. nt2 
4-255 1 r*HC~J4©ffi*E5l|**l=Lfc^ ; 5-f 
-?-©*§.?>£:b-ti-)-J:& P c RI=T:fS&!h.fc#$ 
©□>-tr>-tf-x ?x:>xfr fife. 



n t 2 4~8 2 6 

#32 (5' -ACTCCACCATAGATCACTCC-3' ) 
#122 (5' — AGGTTCCCTGTTGCATAATT — 3' ) 

*? P — > ; C9 397. C9388, C9764 
n t 7 3 2~ 1 9 O 7 

#50 (5' — GCCGACCTCATGGGGTACAT — 3' ) 
#128 (5' -TCGGTCGTGCCCACTACCAC-3' ) 

9P->; C931 6. C9752. C9753 
nt 1867-2591 

#149 (5' — TCTGTGTGTGGCCCAGTGTA — 3' ) 
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#146 (5' -AGTAGCAT 
?P~^ ; C1 1 621, C1 1 

2 6 5©71' i f-tr>X^'5'f "7— #3 6 (5' -AAC 
ACTACTCGGCTAGCAGT-3' ) £ffll^"C 
c DNA££jSLfcff>*.. S-ST/Ut 1 *^*?^ 

dAftJlP£tT(,\ 2S?S<Do n o- s i d e d PCRi* 

«£frofc„ -fisfo*,. HHSli^'JJdT^-fv- 

(2 0-me r) int 1 8 8 - 2 0 7 07^* >X 
7"5'f7-#4 8 (5' -GT TGATCCAAGAA 
AGGACCC-3' ) JfflHt3 5^-f7MPCR 

T^7-C"7— (2 0 -m er) int140~16 
0«r>ft>^^-f7-# 10 9 (2 1 -ma r : 
5' — ACCGGATCCGCAGACCACTAT— 

3' > Sffli>T3 o-7--i'^^aipcRSff^fc <1 se.ti 

1 3fH<D&jtLfc9 P— >. C9 5 7 7, C9579, 
C9581, C9 5 8 4, C9587. C9 5 9 0, C 
9591, C9695, C9 6 0 6. C9609. C9 
615, C9616. C961 9 £f# , ■£*l£ > 0!)zi >-tz 
>+J"X *X>7.^e,n t 1 ~2 3<DigSffiSI]£» 

(6) HC-J6, **«i*l£SS?J<D*S 
HC- J 6j!l8l 0 0m I £1)5*41::, (2) ©7jj*l=* 
1ST A E t 1 0/<<7t'J^|-7 7— yiCq DNA7Y ?7 

y-£«3feLfc., ^-fv-iLt^ 

41IB3 1 8, 2 1 6-^[-^$tLfcHCVlS«IB5t)^#BB 

L."C^fiELfc?K l -Pji^U^-^K# 1 6Zfcii;#8 i £ 

^i^yfT^fc. anc^j:-5ic, $e>*i.fc4ra© c 

DNA5P- ^0 2 (nt6996~8700) , 06 
(nt6485-8700) , 08 (n! 6008-8 
700) , 081 (nt2199~6168) £8S#r 
U fSibflfc^-tZ^X y-?lV^6nt25 
5 2-8 7 0 OfflSSE$l|£*£Lfc,> Cffi3*>, ->P 
->*8 1. 0 8lt-en-p*l.iE?IIS-f-3, 4OT&SE5U 

[0 0 18] (7) H C — J 6 tfj i»(D 3 ' *SWM<Z>£ 
*BB5i]l±ia5lcsrJt;-5l~, PCRKTiitlLT^ft 
kVa-iswMmz&xv&tMLtz, Hc-jeont 
8 7 o i ~9 2 4 i a)ftM\zr>^x\£ffik%:-}U-*— 

#80 (5' -GACACCCGCTGTTTTGAC 
TC-3' ) fc,fctf#60 (5' -GTTCTTACT 
GCCCAGTTGAA — 3' ) SfflL^fcP C RlZfr 
If, »e>JKfc 9 3 84S(D^n- ^C9 7 60. C9 2 
3 4, C9 7 6 1©SSiE?l|J:y^t,h*P>-b^-«-X 
?x>x^t.S3eLfco n i 9 24 2J;yT3E(0 



CATCCACAAGCA-3' ) 
6 2 4, C 1 1 6 5 5 

ttt>*>. y (D s«0saicffio-cRNA^ 

ttHJL. *U (A) 5— tf£ffll*TRNA(D3' 
^SSIzjH'J (A) £fMl]L, +^3" (dT) 20 ST*^ 
-f7-t UT c DNA£fift£ffl,\ #£*lfccDNA£ 
f>7"l/-hi:LTPCRi;«Lfe„ PCRliSeiKPS 
t Lr-tzi/T.^^fV-t Ur#9 7 (5 -AGTC 
10 AGGGCG T CCC T C A T CT-3' ) £. 71^ 
-fe^X^ ; 5-f7- ( t LT^-'Jzf (d T) zo Sffl^rtr 
-3fc. #1-, ^?>*1.fccDNA^SK2StePCRtL7: 

m i spsj; yTsfii=ffla-r«-b>x^7'f7~#9 o 

(5' — GCCGTTTGCGGCCGATATCT— 
3' ) £, T^^-b^X^^-fV-tLr^-'J^ (d 
T) jo ^fflurfrofco 2SPgflJPCR7?#e.tlfc«» 
tS£ttl= T4DNA*iJ 5— tffcftffl Sti-ffijlS* spjftIC 
Lfcjfc. T4#'ja-7Uj-*K*7"-- bfl-T5' *S5£ 

MVKfcL, M 1 3 m p 1 977-' *J*t>2— OiH i n 
20 c I iaifel^^7R-->^LT:«SE?l£*SU 
fz 0 uoLT^^fclfli^P-^ C103 1 
1, C 1 O 3 1 3. C 1 0 3 1 4. C10320, CI 
0322, CI 0323, CI 0 3 2 6, C1032 
8, C10330. C10333, C10334, CI 
0336. C10337, C10345, C1034 
6. C10347, C10349. C10350, CI 
0 3 5 7&P>ftt,tL&=il'-k>-y-X >x>XJ:y 
3' *JM0MMHBM£*«Lfe. JELhOflUHMtJ^J 
HC-J6^ Alzft^-T^c DNA«SE?i]£K5>J# 
30 -5|-2fae<»J: SKftSSU ttyMffl^fiRNAfi 
«K*iJ£E*i|## 1 ffittro* 5 IcSfeS Lfc„ 
too 19] C8) HC-J6, 75/aBB?Jfl)3SS 
HC-J 6 7VAOTRN ASSESS*, irK, Mfep h' 
>ATGi y *S*Sp- Kfg*a>7 5 / ®EM£E5ilS 
#5QJ,i:7l-3tSLfc. HC-J6tt3 0 3 

3 7 5 7* -f KSHH*£p-K 

-fa i wgut-^'j-fo^u-Ajrya^, 

(0 02 0] 

40 [S«ro»*] *f8WI*. *A*BSBFSfe7-f*XHC 

-je^ai, /uxit£^©*ig£E5ij£$ 

-f*X^S®S)^tf]^5Ct^-e^. *fc, Z*UC!& 

[O 02 1 ] 

[6E5II*] 
E?l#^ : 1 
EMfi>£3 : 9 5 8 9 

so &m<am ■■ w& 



1 



E5l|©Hgg : g O n om i c RN A 

ACCCGCCCCU AAUAGGGGCG ACACUCCGCC 
CUUCACGCAG AAAGCGUCUA GCCAIIGGCGU 
CCCCCUCCCG GGAGAGCCAU AGUGGUCIiGC 
AAGACUGGGU CCUUIICUUGG AlfAAACCCAC 
CAAGACUGCU AGCCGAGUAG CGUUGGGUUG 
UGCUNGCGAG UGCCCCGGGA GGUCUCGUAG 
UCAAAGAAAA ACCAAAAGAA ACACCAACCG 
CGGCCAGAHC GUUGGCGGAG UAUACUUGUU 
CGCGACAAGG AAGACUUC6G AGCG6UCCCA 
GGAUCGGCGC UCCACUGGCA AAUCCUGGGG 
GAAUGAGGGA CUCGGCUGGG CAGGAUGGCU 
GGGCCCCAAU GACCCCCGGC AUAGGUCCCG 
GUGCGGCUUlf GCCGACCUCA UGGGGUACAU 
CGCCAGAGCU CUCGCGCAUG GCGUGAGAGU 
GAACUUACCC GGUUGCUCCU UUUCUAIICUU 
CCCGGUCLICC GcllGCCGAAG liGAAGAACAU 
CACCAAUGAU AGCAUUACCU GGCAACUCCA 
CCCGUGCGAG AAAGUGGGGA AUACAWCliCG 



E9I 

AUGAACCACU CCCCMAG GAACUACUGO 60 
UAGUAUGAGU GUCGUACAGC CUCCAGGCCC 120 
GGAACCGGUG AGUACACCGG AAUUGCCGGG 180 
UCUAUGCCCG GUCAUUUGGG CGtlGCCCCCG 240 
CGAAAGGCClf UGUGGUACUG CCiKSAUAGGG 300 
ACCGl/GCACC AUGAGCACAA AUCCUAAACC 360 
HCGCCCACAA GACGUUAAGU UUCCGGGCGG 420 
GCCGC6CAGG GGCCCCAGGU UGGGUGUGCG 480 
GCCACGUGGA AGGCGCCAGC CCAUCCCUAA 540 
AAAACCAGGA liACCCCUGGC CCCUAJACGG 600 
CCtlGUCCCCC CGAGGtltlCCC GUCCCUCUUG 660 
CAACGUGGGU AAGGUCAUCG AUACCCUAAC 720 
CCCUGUCGUA GGCGCCCCGC UCGGCGGCGU 78$ 
CCUGGAGGAC GGGGOUAAUU UUGCAACAGG 840 
CUIfGCUGGCC CUGCUGUCCU GCAUCACCAC 900 
CAGUACCGGC UACAUGGUGA CCAACGACUG J60 
GGCUGCUGUC CUGgAGGUCC CCGGGUGCGU 1 020 
GUGCUSGAUA CCGGUCIiCAC CGAAOGUGGC 1080 
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CGUGCAGCAG 


CCCGGCGCCC 


(JCACGCAGGG 


GUCCGCCACG 


CUCU6CUCCG 


CUCHUUACGU 


AGCCCAGAUG 


OUCAUUGUCU 


CGCCACAGCA 


CAUCUACCCU 


GGUACCAUCA 


CUGGACACCG 


GCCCACGGCU 


ACCAUGAUCC 


SJGGCGUACGC 


CAWJGGCGGG 


GOJCAUWGCG 


GCGUCAUGIW 


GUGGGCAAAA 


GUCGUUGOCA 


UUCUUUUGCU 


CGtfUGGGflGU 


OACCGCCC 


AUAACGCCAG 


CAGGCAGAAA 


AUCCAGCUCA 


liCAACACCAA 


GAACUGCAAU 


GACUCUUUGC 


ACACCGGCUU 


CAACUCGUCA 


GGAUGUCCCG 


AACGCAUGUC 


GGGAUGGGGC 


GCCUUACAAU 


AUGAGGACAA 


UUGCUGGCAC 


UACCCACCAA 


GACA6UGUGG 


AGUOUACUGU 


UUCACCCCCA 


GCCCAGUAGU 


CACUUACACG 


UGGGGGGAGA 


AUGA6ACAGA 


GCAGG6GUCA 


tIGGtltfCGGCU 


GCACGttGGAU 


CGCACCACCC 


UGCCGCAUUA 


GAGCUGACUU 


GGACUGUUUU 


AGGAAGCAUC 


CUGAUACCAC 


CACGCCAAGG 


UGCCU6AUCG 


ACUACCCCUA 


CUAUACCAUC 


UUCAAAAUAA 


GGAUGUAUGU 


GUGCAAUUUC 


ACUCGUGGGG 


AUCGJUGCAA 


lfCCU«UGCUG 


CACUCCACCA 


CGGAGUGGGC 


CGCCtlUGUCG 


ACUGGUCUUC 


UCCACCUCCA 


UGGCCUAUCA 


CCUGCUCUCA 


CAAAAUACAU 


CCUGCUCUUA 


GCGGACGCCA 


GGGUUIIGCGC 


GGCCGAAGCA 


GCACUA6AGA 


AGHUGGUCGU 


UGGCUUCCUA 


UACUUUGUCA 


UCUUUUlfCGU 


CCCCUUGGCU 


ACtHJAUUCCC 


UCACNGGCCl 


GCCCCAACAG 


GCUUAUGCUU 


AUGACGCAUC 



CUtlACGGACG CACAUUGACA UGGUUGUGAU 1140 
GGGGGACCUC U6CGGUGGGG UGAUGCUUGC 1200 
CCACUGGOtlU GHGCAAGACU GCAAUIJGCUC 1260 
CAUGGCGUGG GACAUGABGA UGAACUGGUC 1320 
GAUGCGCGUC CCCGAGGUCA UCAUAGACAU 1380 
CGGCUUAGCC UACIKJCUCUA UGCAfiGGAGC 1440 
GGCCGCC6G6 GUGGACGCGC AAACCCAUAC 1500 
GACCCUCACC GGCAUGUUCU CCCUUGGUGC 1560 
UGGCAGUUGG CACAUCAACC GCACCGCCCU 1620 
CCOCGCGUCA CJGlfUCUACA CCCACAGCUO 1660 
CGCCUGCCGC AGUAUCGAGG CCUIII/CGGGll 1740 
(JGUCACCAAU CCAGAGGAUA UGAGACCGW 1800 
IGUAGUCUCC GCGAGCUCUG UGtfGUGGCCC 1660 
AGUGGGUACG ACCGAIfAGAC UUGGAGCGCC 1920 
UGMCCUA UUGAACAGCA CUCGACCACC 1980 
GAACUCCACU GGCUACACCA AGACUUGCGG 2040 
CAAUGCCAGC AUGGACUUGU UGUGCCCCAC 2100 
CUACA0CAAA UGUGGCUCUG GGCCCUGGCU 2160 
CAGGCUCUGG CAUUACCCCU GCACAGUUAA 2220 
GGGGGGGGUC GAGCACAGGC UCACGGCUGC 2280 
CUUGGAGGAC AGAGACAGAA GUCAACUGJC 2340 
CAJUUUACCl! UGCACUUACU CGGACClfGCC 2400 
CCAAAACAlfC GUGGACGUGC AAUUCAOGilA 2460 
CGUGCGAUGG GAGUGGGUAG UACUCUUAtfll 2520 
CUGCUUAUGG AUGCUCAUCU UGBUGGGCCA 2580 
CUUGCACGCU GCGAGCGCAG COAGCUGCAA 2640 
GGCUGCUUGG UACAUCAAGG GBCGGGUAGU 2700 
AUGG8CCUUU 6GCCUACUGC UCCUAGCAUli 2760 
UGUACAUGGU CAGAUAGGAG CAGCIiClfGUU 2820 
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GGUACUGAUC ACUCUCUIJUA CACBCACCCC CGGGUAUAAG ACCCWUCUCA GCCGGUUUCU 2880 
GHGGUGGUUG UGCUABCUIJC NGACCCHGGC GGAAGCiiAUG GUCCAG6AGU GGGCACCACC 2940 
UAUGCAGGUG CGCGG06GCC GtfGAUGGGAil CAUAUGGGCC GUCGCCAUAU UCUGCCCGGG 3000 
UG0G6UGMU GACAUAACCA AGUGGCUCHU GGCGGUGCUU 6GGCCUGCUU AUCUCCUAAA 3060 
AGGUGCUMG ACGCGUGUGC CGUACUUCGU CAGGGCUCAC GCUCUACUAA GGAU6UGCAC 3120 
CAIIGGUAA6G CAUCUCGCGG GGGGUAGGUA CGOCCAGALIG GUGCUACUAG CCCUUGGCAG 3180 
GUGGACUGGC ACUUACAUCU AUGACCACCU CACCCCUAIiG UCGGAUUGGG ClfGCUAAUGG 3240 
CCU6CGGGAC UUGGCGGUCG CCGUGGAGCC UAUCAUCUUC AGtlCCGAUGG AGAAAAAAGU 3300 
CAUCGUCUGG GGAGCGGAGA CAGCUGCUIIG CGGGGAHAUC UUACACGGAC UUCCCGUGUC 3360 
CfiCCCGACUU GGCCGGGAGG UCCBCCUUGG CCCAGCUGAU GGCOAUACCU CCAAGGGGUG 3420 
GAGtlCUHCUC GCCCCCAUCA CWGCtlUAUGC CCAGCAGACA C6CG6CCUU0 WGGGCACCAU 3480 
AGUGGUGAGC AUGACGGGGC GCGACAAGAC AGAACAGGCC GGGGAGAUUC AGGUCCUGUC 3540 
CAC6GUCACH CAGUCCUUCC JCG6AACAAC CAUCUCG6G6 GUCUUAUGGA CUGUCUACCA 3600 
UGGAGCUGGC AACAAGACUC JAGCCGGCUC ACGfiGGUCCG GUCACACAGA UGUACDCCAG 3660 
UGCtlGAGGGG GACUIIAGUGG GGUGGCCCAG CCCCCCCGGG ACCAAAUCtlU UGGAGCCGUG 3720 
CACGUGUGGA GCGGUCGACC tIAUACCUGGU CACGCGAAAC GCUGAUGUCA UCCCGGCUCG 3780 
AAGACGCGGG GACAAGCGAG GAGCGCUACU CUCCCCGAGA CCUCUUUCCA CCUUGAAGGG 3840 
GUCCUCGGGG GGCCC6GUGC (JCUGCCCCAG AGGCCACGCU GUCGGGGUCU HCCGGGCAGC 3900 
CGUGUGCUCC CGGGGCGUGG CCAAGUCCAU AGAIHIUUAUC CCCGUUGAGA CACUUGACAU 3960 
CGUCACUCGG UCCCCCACCU UUAGUGACAA CAGCACACCA CCI/GCUGUGC CCCAAACUUA 4020 
UCAGGUCGGG l/ACUUACAUG CCCCGACUGG UAGIfGGAAAG AGCACCAAAG UCCCUGUCGC 4080 
GUAUGCCGCU CAGGGGUACA AAGUGCBAGU GCUUAAUCCC UCGGUGGCUG CCACCCUGGG 4140 
GUUlfGGGGCG UACUUGIKCA AGGCACAUGG CAUCAAUCCC AACAtlUAGGA CUGGGGUCAG 4200 
GACUGUGACG ACCGG66CGC CCAUCACGUA C8CCACAUAU GGCAAAUUCC JCGCCGAUGG 4260 
GGGCUGCGCA GGCGGCGCCtl AUGACAUCAU CAUAUGCGAU GAAUGCCAUG CCGOGGACUC 4320 
UACCACCAUU CUCGGCAUCC GAACAGUCCU CGAUCAAGCA GAGACAGCCG GGGtlCAGGCU 4380 
AACUGUACUG GCUACGGCIIA CGCCCCCCGG GUCAGUGACA ACCCCCCACC CCAACAUAGA 4440 
GGAGGUGGCC CUC6GGCAGG AGGGUGAGAU CCCCUUCtfAU GGGAGGGCGA UUCCCGUGUC 4500 
AUACAUCAAG GGAGGAAGAC ACUUGAUCUU CUGCCACUCA AAGAAAAAGU BUGACGAGCU 4560 



{ 49 ) 



^819^06-121689 



CGCGGCGGCC CIJUCGGGGUA UGGGCUU6AA CGCAGMGGCA UACIIACAGAG GGCUGGACGU 4620 
CHCCGUAAUA CCAACUCAGG GAGACGUAGU GGIJCGUCGCC ACCGACGCCC (KAUGACGGG 4680 
GUUUACU6GA GACUUUGACU CCGHGAUCGA CUGCAACGUA GCGGUCACUC AAGUUGUAGA 4740 
CUJCAGCUUG GACCCCACAli UCACCAUAAC CACACAGACU GUCCCttCAAG ACGCUGUCUC 4800 
ACGUAGCCAG CGCCG6GGCC GCACGGGCAG GGGAAGACUG GGUAUUUAUA GG1IAUGUUUC 4860 
CACBGGlfGAG CGAGCCUCAG GAAUGUiiUGA CAGUGUAGUG CUCUGCGAGU GCUACGAUGC 4920 
AGGGGCC6CA (IGGUAUGAGC UCACACCAGC GGAGACCACC GUCAG6CUCA GAGCAUAIfUU 4980 
CAACACACCU GGMJUGCCUG UGUGCCAAGA CCAUCUUGAG UUUUGGGAGG CAGUUUUCAC 5040 
CGGCCUCACA CACAUAGAUG CCCACIRICCU UUCCCAAACA AAGCAAUCGG GG6AAAAPU 5100 
CGCAtfACUUA ACAGCCUACC AGGCUACAGU GUGCGCUAGG GCCAAAGCCC CCCCCCCGtIC 5160 
CUGGGACGUC AUGUGGAAGU GUMGACUC6 ACIICAAGCCC ACACUCGUGG GCCCCACACC 5220 
WCUCCUGUAC CGCUUGGGCU CUGUUACCAA CGAGGUCACC CUCACGCAOC CUGUGACGAA 5280 
AUACAUCGCC ACCUGCAUGC AAGCCGACCM UGAGGUCAUG ACCAGCACGU GGGUCUUAGC 5340 
UGGGGGGGUC UBGGCGGCCG UCGCCGCCUA CUGCCUGGCG ACCGGGUGUG UIHIGCAUCAU 5400 
CGGCCGCUtlG CACGUUAACC AGCGAGCCGIf CGUOGCACCG GACAAGGAGG UCCUCUAUGA 5460 
GGCUHMGAU GAGAUGGAGG AAUGtlGCCUC UAGAGCG6CIJ CUCAUKGAAG AGGG6CAGCG 5520 
GAUAGCCGAG AUGCUGAAGU CCAAGAUCCA AGGCUUAUUG CAGCAAGCUU CCAAACAAGC 5580 
UGAAGACAUA CAACCCGCUG UGCAGGCUUC UUGGCCCAAG GUAGAGCAAU UCUGGGCCAA 5640 
ACACAUGUGG AACUHCAUCA GCGGCAUUCA AUACCUCGCA GGACUAUCAA CACUGCCAGG 5700 
GAACCCOGCU GUAGCOUCCA UGAUGGCAUU CAGHGCCGCC CUCACCAGUC CGUUGUCAAC 5760 
UAGCACCACU AUCCUUCUCA ACAUUtWGGG GGGCJGGCUA GCAUCCCAAA IIUGCGCCUCC 5820 
CGCGGGGGCU ACCGGCUUCG UCGUCAGUGG CCBGGUGGGG GCVGCCGUAG GCAGCAUAGG 5880 
CtlUGGGUAAG GOGCUGGUGG ACAUCCUGGC AGGGUAUGGU GCGGGCAOUU C6GGGGCUCU 5940 
CGUCGCAtWC AAGAlfCAUGU CUGGCGAGAA GCCCUCCAUG GAGGAUGUUG UCAACCUGCU 6000 
GCCUGGAAUU CtlGUCUCCGG GUGCCCUGGU GGUGGGAGUC AUCUGCGCGG CCAttCCUGCG 6060 
CCGACACGK6 GGACCGGGGG AAGGCGCUGO CCAAUGGAUG AAI/AGGCUCA UUCCCUUUGC 6120 
UUCCAGAGGA AACCACGlfCG CCCCCACCCA CUACGUGACG GAGUCGGAUG CGUCGCAGCG 6180 
l/GUGACCCAA OlfACUUGGCU CCCIWACCAU AACCAGCCUG CUCAGGAGAC UCCACAACHG 6240 
GAUUACUGAA GACUGCCCGA UCCCAUGCAG CGGCJCGUGG CUCCGCGAUG (JGUGGGAUUG 6300 
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GGUUUGCACC AIJCCUAACAG ACUUUAAAAA CUGGCUGACC HCCAAAUUGU lICCCAAAGAff 6360 
GCCWGGfJCUC CCCUUUAUCU CUUGUCAAAA GGGGUACAAG GGCGUGUG6G CUGGCACUGG 6424 
IIAUCAUGACC ACACGGUGUC CUUGC6GCGC CAAUAUCUCU GGCAAII6UCC GCCUGGGCUC 6480 
CAUGAGAAUU ACGGGGCCCA AAACCHGCAU GAAUAUCUGG CAGGGGACCU IIUCCCAUCAA 6540 
IIUGUUACACG GAGGGCCAGU GCGUGCCGAA ACCCGCACCA AACUU1IAAGA UCGCCAUCUG 6600 
GAGGGUGGCG GCCUCAGAGIi ACGCGGAGG8 GACGCAGCAC G6GUCAHACC ACUACAUAAC 6660 
AGGACUUACC ACUGAUAACU OGAAAGlfUCC UUGCCAACUA CCUUCUCCAG AGUUCUUUUC 6720 
CtfGGGUGGAC GGAGIXJCAGA UCCAUAGGUU UGCCCCCAUA CCGAAGCCGU IWUUUCGGGA 6780 
UGAGGUCUCG MClfGCGIJUG GGCUBAAUIIC AUUUGUCGUC G66UCUCAGC tfCCCUUGCGA 6840 
UCCKGAACCU GACACAGACG UAUUGACGUC CAUGCUAACA GACCCAKCCC AIIAUCACGGC 6900 
GGAGACH6CA GCGCGGCGUU UGGCACGGGG GUCACCCCCG OCCGAGfiCAA GCUCCUCAGC 6960 
GAGCCAGCUA UCfiGCACCAU CGCIHJCGAGC CACCU6CACC ACCCACGGCA AGGCCUAUGA 7020 
UGIIGGACAUG GUGGAUGCCA ACCMCAU GGGGGGCGAU GUGACCCGGA UAGAGUCUGA 7080 
GUCCAAA6UG GUCGIUJCUGG ACUCtfCUCGA CCCAAUGGUC GAAGAAAGGA GCGACCUUGA 7140 
GCCIWCGAIIA CCAUCGGAAU AUAUGCtfCCC CAAGAAGAGA litlCCCACCAG CCUUACCGGC 7200 
UUGGGCACGG CCUGAUUACA ACCCACCGCIF UGUGGAAUC6 UGGAAGAGGC CAGAUUACCA 7260 
ACCGGCCACU GlfUGCGGGCU GCGCIfCUCCC CCCCCCOAAG AAAACCCCGA CGCCUCCCCC 7320 
AAGGAGACGC CGGACAGUGG GUCUGAGUGA GAGCUCCAUA GCAGAUGCCC UACAACAGCU 7360 
GGCCAUCAAG UCCUUHGGCC AGCCCCCCCC AAGCGGCGAIf UCAGGCCUUU CCACGGGGGC 7440 
GGACGCAGCC GAUUCCGGCA GOCGGACGCC CCCCGAHGA6 OlfGGCCCUIW CGGAGACAGG 7500 
UtJCCAUCUCC UCCAUGCCCC CUCHCGAGGG GGAGCCHGGA GAUCCAGACtt (JGGAGCCUGA 7560 
GCAGGUAGAG CUUCAACCUC CCCCCCAGGG GGGGGUGGUA ACCCCCGGCU CAG6CUCGGG 7620 
GOUGGUCU ACUUGCUCCG AGGAGGACGA CUCCGUCGUG tfGCUCCIiCCA UGUCAUACUC 7680 
CKGGACCGGG GCUCUAAUAA CUCCUUGUAG CCCCGAAGAG GAAAAGUUGC CAAUUGGCCC 7740 
CltUGAGCAAC UCCCUGUUGC GAUAUCACAA CAAGGUGUAC WGUACCACAU CAAAGAGCGC 7800 
CUCAfJUAAGG GCUAAAAAGG UAACUUUUGA UAGGAUGCAA GCGCUCGACG CUCAUUAUGA 7860 
CUCAGHCUlfG AAQGACAUUA AGCOAGCGGC CUCCAAGGUC ACCGCAAGGC UNCUCACUUtf 7920 
AGAGGAGGCC UGCCAGUUAA CUCCACCCCA C8C1IGCAAGA NCCAAGUAUG GGUUUGGGGC 7980 
UAAGGAGGUC CGCAGCUUGU CCGGGAGAGC CGUUAACCAC AUCAAGUCCG UGUGGAAGGA 6040 
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CCUCCUGGAA GACACACAAA CACCAAUUCC 
CBGCGUGflAC CCCACCAAGC GGGGUAAGAA 
COGCGUCAGG GlfCUGCGAGA AAAUGGCCCU 
GGUGAUGGGG GCUUCUUAUG GAUUCCAGUA 
GAAGGCAUGG GCGGAAAAGA AAGACCCUAU 
CttCAACCGUC ACUGAGAGAG ACAUCAGGAC 
GCCCGAGGAG GCCCACACUG CCAUACACHC 
CAUGUUCAAC AGCAA6GGCC AGACCUGCGG 
UACCACUAGC AUGGGGAACA CCAUCACAHG 
UGCAGGGAUA AUUGCGCCCA CAAUGCUGGU 
GAGCCAGGGG ACCGAGGAGG ACGAGCGGAA 
GUAtlUClfGCC CCIKCUGGUG ACCCCCCCAG 
UlfGCUCClICA AAUGUGOG JGGCGUUGGG 
CAGAGACCCU ACCACUCCAA UCGCCCGGGC 
CAAliUCAUGG CUAGGAAACA UCAUCCAGUA 
GAUGACACAC UUCtlUCUCCA UtfCUCAifGGC 
UGAGAUGUAC GGAGCGGlfGU ACUCCGUGAG 
GIHIACACGGG CUUGACGCUl) UCUCUCUGCA 
G6CUUCAGCC CUCAGAAAAC (JUGGGGCGCC 
UGCAGUCAGG GCGUCCCUCA UCUCCCGUGG 
CHHCAACUGG GCGGUGAAGA CCAAGCUCAA 
GGAUUUAUCC AGCUGGUIKA CUGUCGGCGC 
GCGUGCCCGA CCCCGCWUAU UACUCCUUGG 
UWUCCUACUC CCCGCIfCGGU AGAGCGGCAC 
CUUUUUUIHJU UUUUUUMJU UUUUUUlfUUU 
UUUDMUU UUUUUflUUUU JlflJULIUUUUU 

[0022] EMS*: 2 
BBJlJrofi* : 9 5 8 9 

$s©& : -** 



UACAACCAUC AUGGCCAAAA AUGAGGUGUU 8100 
AGCAGCUCGC CttUAUCGUfJU ACCCUGACCU 8160 
tjtfALIGAUAUC ACACAAAAGC UUCCUCAGGC 8220 
CJICCCCCGCU CAGCGGGUGG AGttUUCtlCtlU 8280 
GGGUUUUUCG UAUGAUACCC GAUGCUUUGA 8340 
UGA6GAGUCC AUAUAUCGGG CUUGUUCCUU 8400 
ACUGACUGAG AGACUIHJACG KGGGAGGGCC 8460 
GUACAGGCGU UGCCGCGCCA GCGGGGBGCl) 8520 
CUAOGUGAAA GCCIWAGCGG CCUGUAAGGC 8580 
AUGCGGCGAU GACUtfGGUUG IICAUCUCAGA 8640 
CCUGAGAGCC UUCACGGA6G CtfAUGACCAG 3700 
ACCGGAAUAU GACCUGGAGC UGAUAACAIIC 8760 
CCCACAAGGC CGCCGCAGAO ACUACCUGAC 8820 
(JGCCUGGGAA ACAGlfl/AGAC ACUCCCCUGU 8880 
CGCCCCAACC AUAUGGGCUC GCAUGGUCCU 8940 
CCAAGAUACU CHGGACCAGA ACCUCAACUU 9000 
UCCCiiUGGAC CUCCCAfiCCA UAAUUGAAAG 9060 
CACAUACACU CCCCACGAAC UGACACGGGU 9120 
ACCCCUCAGA GCGUGGAAGA GCCGGGCACG 9180 
GGGGAGAGCG GCCGUUUGCG GCCGAWAUCU 9240 
ACIfCACUCCA DOGCCGGAAG CGCGCCUCCU 9300 
CGGCGGGGGC GACAUUUAUC ACAGCGUGUC 9360 
CCUACUCCUA CUUUUU6UAG GGGUAGGCCU 9420 
ACABUAGCUA CACUCCAUAG CtfAACWGUCC 9480 
UtlOUOUUUUU UUUUUUifUUU EfUUUUlfUUUU 9540 
UUUdUUOUUU UEJdUUaiiUU 9589 

WiM<J)ffl& : c D N A to genomic R N A 

mm 

40 
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ACCCGCCCCT AATAGGGGCG ACACTCCGCC ATGAACCACT CCCCTGTGAG GAACTACTGF 60 
CTTCACGCAG AAAGCGTCTA GCCATGGCGf TAflTATGAGT GTCGTACAGC CTCCAGGCCC 120 
CCCCCTCCCG G6AGA6CCAT AGTG6TCTGC GGAACCGGTG AGFACACCGG AATTGCCGGG 180 
AA6ACTGGGT CCTTTCTTGG ATAAACCCAC TCTATGCCCG GTCATTTGGG CGTGCCCCCG 240 
CAAGACTGCT AGCCGAGTAG CGTTGGGTTS CGAAAGGCCT TGTGGTACTG CCTGATAG6G 300 
TGCTTGC6AG TGCCCCGGGA GG7CTCGTAG ACCGTGCACC ATGAGCACAA ATCCTAAACC 360 
TCAAAGAAAA ACCAAAAGAA ACACCAACCG TCGCCCACAA GACGTTAAGT TTCCGGGCGG 420 
CGGCCAGAFC GTTGGCGGAG TATACTTGTT GCCGCGCAGG GGCCCCAGGT TGGGTGTGC6 480 
CGCGACAAGG AAGACTTCGG AGCGGTCCCA GCCACGTGGA AGGCGCCAGC CCATCCC7AA 540 
GGAIC6GCGC TCCACTGGCA AATCCTGGGG AAAACCAGGA TACCCCTGGC CCCTAfACGG 600 
GAATGAGGGA CTCGGCTG6G CAGGATGGCT CCTGTCCCCC CGAGGTTCCC GTGCCTCTTG 660 
GG6CCCCAAT GACCCCCGGC ATAGGTCCCG CAACGTGGGT AA6GTCATCG ATACCCTAAC 720 
GTGCGGCTTT GCCGACCTCA TGGGGTACAT CCCTGTCGTA GGCGCCCCGC TCGGCGGCGT 780 
CGCCAGAGCT CTCGCGCATG GCGTGAGAGT CCTGGAGGAC GGGGT7AATT TTGCAACAGG 840 
GAACTTACCC GGTTGCTCCT TTTCTArCTT CTTGCTOGCC CTGCTGTCCT GCATCACCAC 900 
CCCGGTCTCC GCTGCCGAAG TGAAGAACAT CAGTACC6GC TACATGGTGA CCAACGACTG 960 
CACCAATGAT AGCATTACCT GGCAACTCCA GGCTGCTGrC CTCCACGTCC CCGGGfGCGT 1020 
CCCGTGCGAG AAAGT6GGGA ATACATCTCG GTGCTGGATA CC6GTCTCAC CGAATGTGGC 1080 
CGTGCAGCAG CCCGGCGCCC TCACGCAGGG CTTACG6ACG CACATTGACA TGGTTGfGAT 1140 
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GTCCGCCACG C7CTGC7CCG CTCTTTACGT GGG6GACC7C TGC6GTGGGG TGA7GCT7GC 1200 
AGCCCAGA7G TTCATTGTCT CGCCACAGCA CCACTGGTTT GTGCAAGACT GCAATTGC7C 1260 
CA7C7ACCCT GGTACCATCA CTGGACACCG CATGGCGTGG GACATGATGA TGAACTGGTC 1320 
GCCCACGGCT ACCATGATCC T6GCGTACGC GATGCGCGTC CCCGAGGTCA 7CA7AGACA7 1380 
CATT6GCGG6 GC7CA77GGG GCG7CA7G7T CGGCTTAGCC TACTTCTCTA TGCAGGGAGC 1440 
GTGGGCAAAA GTCSTTGTCA TTCTTTTGCT GGCCGCCGGG G7GGACGCGC AAACCCATAC 1500 
CG77GGGGG7 TCTACCGCGC ATAAC6CCAG GACCCTCACC GGCATGTTCT CCCTTGGTGC 1560 
CAGGCAGAAA ATCCAGCTCA TCAACACCAA TGGCAGTTGG CACATCAACC GCACCGCCCT 1620 
GAACTGCAAT GACTCTTTGC ACACCGGC77 CCTCGCGTCA CTGTTCTACA CCCACAGCTT 1680 
CAACTCGTCA GGATGTCCCG AACGCATGTC CGCCTGCCGC AGTATCGAGG CCTTTCGGGT U40 
GGGATGGGGC 6CCTTACAAT ATGAGGACAA TGTCACCAAT CCAGAGGATA TGAGACCGTA 1400 
TTGCTGGCAC TACCCACCAA GACAGTGTG6 TGTAGTCTCC GCGAGCTCTG 7GTGTGGCCC 1860 
AG7G7ACTGT TTCACCCCCA GCCCAGTAGT AGTGGGTACfi ACCGATAGAC TTGGAGCGCC 1920 
CACTTACACG TGGGGGGAGA ATGAGACAGA TGTCTTCCTA TTGAACAGCA CTCGACCACC 1980 
GCAGGGGTCA TGGTTCGGCT GCACG7GGA7 GAACTCCACT GGCTACACCA AGACTTGCGG 2040 
CGCACCACCC TGCCGCATTA GAGCTGACTT CAATGCCAGC ATGGACTTGT TGTGCCCCAC 2100 
GGACTGTm AGGAAGCATC CTGATACCAC CTACATCAAA TGFGGCTCTG GGCCCTGGCT 2160 
CACGCCAA6G TGCCTGATCG ACTACCCCTA CAGGCTCTGG CATTACCCCT GCACAGTTAA 2220 
CTATACCATC TTCAAAATAA GGATGTATGT GG6GGGGGTC GAGCACAGGC TCACGGCTGC 2280 
GTGCAATTTC ACTCGTGGGG ATCGTTGCAA CTTGGAGGAC AGAGACAGAA GTCAACTGTC 2340 
TCCTTTGCTG CACTCCACCA CGGAGTGGGC CA777TACCT TGCACTTACT CGGACCTGCC 2400 
CGCCTTGTCG ACTGGTCTTC TCCACCTCCA CCAAAACATC GTGGACGTGC AATTCATGTA 2460 
TGGCCTATCA CCTGCTCTCA CAAAATACAT CGTCCGATGG GAGTGG6TAG TACTCTTATT 2520 
CCTGCTCTTA GCGGACGCCA GGGITTGCGC CTGCTTAT6G ATGCTCATCT TGTTGGGCCA 2580 
GGGCGAAGCA GCACTAGAGA AGTTGGTCGT CTTGCACGCT GC6AGCGCAG CTAGCTGCAA 2640 
TGGCTTCCTA TACTTTGTCA TCTTTTTCGT GGCTGCTTGG TACATCAAGG GTCGGGTAGT 2700 
CCCCTTGGCr ACTTATTCCC TCACTGGCCT ATGGTCCTTT GGCCTACTGC TCC7AGCAT7 2760 
GCCCCAACAG GC77A7GC77 A7GACGCA7C 7G7ACATGGT CAGATAGGAG CAGC7CFGT7 2820 
GG7ACTGA7C AC7C7C7TTA CAC7CACCCC CGGG7A7AAG ACCCT7CTCA GCCGGTT7C7 2880 
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GTGGT6GTTG TGCTATCTTC TGACCCTGGC GGAAGCFATG GTCCAGGAGT GGGCACCACC 2940 
fATGCAGGTG CGCGGTGGCC GTGATGGGAT CATATGGGCC GTCflCCATAT 1CTGCCCGGG 3000 
TGTGGTGTTT GACATAACCA AGTGGCTCTT GGCGGTGCTT GGGCCTGCTT ATCTCCTAAA 3060 
AGGTGCTTTG ACGCGTGTGC CGTACTTCGT CAGGGCTCAC GCTCTACTAA GGATGTGCAC 3120 
CATGGTAAGG CATCTCGCGG GGGGFAGGTA CGTCCAGATG GTGCTACTA6 CCCTTGGCAG 3180 
GTGGACTGGC ACTTACATCT ATGACCACCT CACCCCTATG TCGGATTGGG CTGCTAATGG 3240 
CCTGCGfiGAC TTGGCGGTCG CCGTGGAGCC TATCATCTTC AGTCCGATGG AGAAAAAA6T 3300 
CATCGTCTGG GGAGCGGAGA CAGCTGCTTG CGGGGATATC TTACACGGAC TTCCCGTGTC 3360 
CGCCCGACTT GGCCGGGAGG TCCTCCTTGG CCCAGCTGAT GGCTATACCT CCAAGGGGTG 3420 
GAGTCTTCTC GCCCCCATCA CTGCTTATGC CCAQCAGACA CQCGGCCTTr TGG6CACCAT 3480 
AGTGGTGAGC ATGACGGGGC GCGACAAGAC AGAACAGGCC GGGGAGATTC AGGTCCTGTC 3540 
CACGGTCACT CAGTCCnCC TCGGAACAAC CATCTCGGGG GTCTTATGGA CTGTCTACCA 3G00 
TGGAGCTGGC AACAAGAC7C TAGCCGGCTC ACGGGGTCCG GTCACACAGA TGTACTCCAG 3660 
TGCTGAGGGG GACTTAGTGG GGTGGCCCAG CCCCCCCGGG ACCAAATCTT fGGAGCCGTG 3720 
CACGTGTGGA GCGGTCGACC TATACCTGGT CACGCGAAAC GCTGATGTCA TCCCGGCTCG 3780 
AAGACGCGGG GACAAGCGAG GAGCGCTACT CTCCCCGAGA CCTCTTTCCA CCTTGAAGGG 3840 
GTCCTCGGGG GGCCCGGTGC TCTGCCCCAG AGGCCACGCT GTCGGGGTCT TCCGGGCA6C 3900 
CGTGTGCTCC CGGGGCGTGG CCAAGTCCAT AGATTTTATC CCCGTTGAGA CACTT6ACAT 3960 
CGTCACTCGG TCCCCCACCT TTAGTGACAA CAGCACACCA CCTGCTGTGC CCCAAACTTA 4020 
TCAGGTC6GG TACTTACATG CCCCGACTGG TAGTGGAAAG AGCACCAAAG TCCCTGTCGC 4080 
GTATGCCGCT CAGGGGTACA AAGTGCfAGT GCTTAATCCC TCGGTGGCTG CCACCCTGGG 4140 
GTTTGGGSCG TACTTGTCCA AGGCACATGG CATCAATCCC AACATTAGGA CTGGSGTCAG 4200 
GACTGTGACG ACCGGGGCGC CCATCACGTA CTCCACATAT GGCAAATTCC TCGCCGATGG 4260 
GG6CTGCGCA GGCG6CGCCT ATGACATCAT CATATGCGAf GAATGCCATG CCGTGGACTC 4320 
TACCACCATT CTC6GCATCG 6AACAGTCCT CGATCAAGCA GAGACAGCCG GGGTCAGGCT 4380 
AACTGTACTG GCTACGGCTA CGCCCCCCGG GTCAGTGACA ACCCCCCACC CCAACATAGA 4440 
GGAGGTGGCC CTCGGGCAGG AGGGTGAGAT CCCCTTCTAT GGGAGGGCGA TTCCCCTGTC 4500 
ATACATCAAG GGAGGAAGAC ACTTfiATCTT CTGCCACTCA AAGAAAAAGT GTGACGAGCT 4560 
CGCGGCGGCC CTTCGGGGTA TGGGCTTGAA CGCAGTGGCA TACTACAGAG GGCTGGACGT 4620 
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CTCC6TAATA CCAACTCA6G GAGACGTAGT GGTCGTCGCC ACCGACGCCC TCATGACGGG 4680 
GTTTACrWA GACrTTGACT CCGTGATCGA C7GCAACGTA GCGGTCACTC AAGTTGTAGA 4740 
CTTCAGCTTG GACCCCACAT TCACCATAAC CACACAGACT 6TCCCTCAAG ACGCTGTCTC 4800 
ACGTAGCCAG CGCCGGGGCC GCACGGGCAG GGGAAGACTG GGTATTTATA GGTATGTTTC 4860 
CACTGGTGAG CGAGCCTCAG GAATGTTTGA CAGTGTAGTG CTCTGCGAGT GCTACGATGC 4920 
AGGGGCCGCA TGGTATGAGC TCACACCAGC GGAGACCACC GTCAGGCTCA GAGCATATTT 4980 
CAACACACCT GGTTTGCCTG TGTGCCAAGA CCATCTTGAG 7TFTGGGAGG CAGT7TTCAC 5Q40 
CGGCCTCACA CACATAGATG CCCACTTCCT TTCCCAAACA AAGCAATCGG GGGAAAATTT 5100 
CGCATACTTA ACAGCCTACC AGGCTACAGT GTGCGCTAGG GCCAAAGCCC CCCCCCCGTC 5160 
CTGGGACGTC ATGTGGAA6T GTTTGACTCG ACTCAAGCCC ACACTCGTGG GCCCCACACC 5220 
TCTCCTGTAC CGCT73G6CT CTGTTACCAA CGAGGTCACC CTCACGCATC CTGTGACGAA 5280 
ATACATCGCC ACCTGCATGC AAGCCGACCT TGAGGTCAT/G ACCAGCACGT GGGTCTTAGC 5340 
TGGGGGGGTC TTGGCGGCCG TCGCCGCGTA CTGCCTGGCG ACCGGGTGTG TTTGCATCAT 5400 
CGGCCGCTTG CACGTTAACC AGCGAGCCGT CGTTGCACCG GACAAGGAGG TCCTCTATGA 5460 
GGCTTTTGAT GAGATGGAGG AATGTGCCTC TAGAGCGGCT CTCATTGAAG AGGGGCAGCG 5520 
GATAGCCGAG ATGCTGAAGT CCAAGATCCA AGGCTTATTG CAGCAA6CTT CCAAACAAGC 5580 
TCAAGACATA CAACCCGCTG TGCAG6CTTC TTGGCCCAAG GTAGAGCAAT TCTGG6CCAA 5640 
ACACATGTGG AACTTCAFCA GCGGCATTCA ATACCTCGCA GGACTATCAA CACTGCCAGG 5700 
GAACCCT6CT GTAGCTTCCA TGATGGCATT DAGTGCCGCC CTCACCAGTC CGTTGTCAAC 5760 
TAGCACCACT ATCCTTCTCA ACATTTTGGG GGGCTGGCTA GCATCCCAAA TTGCGCCTCC 5820 
CGCGGGGGCT ACCGGCTTCG TCGTCAGTGG CCTGGTGGGG GCfGCCGTAG GCAGCATAGG 5880 
CTTGGGTAAG GTGCTGGrGG ACATCCTGGC AGGGTATGGT GCGGGCATTT CGGGGGCTCT 5940 
CGTCGCATTC AAGATCATGT CTGGC6AGAA GCCCTCCATG GAGGATGTTG TCAACCTGCT 6000 
GCCTGGAATT CTGTCTCCGG GTGCCCTGGT GGTGGGAGTC ATCTGCGCGG CCATCCTGCG 6060 
CCGACACGTC GGACCGGGGG AAGGCGCTGT CCAATGGATG AATAGGCTCA TTGCCTTTGC 6120 
TTCCAGAGGA AACCACGTCG CCCCCACCCA CTACGTGACG GAGTCGGATG CGTCGCAGCG 6180 
TGIGACCCAA CTACTTGGCT CCCTTACCAT AACCAGCCTG CTCAGGAGAC TCCACAACTG 6240 
GATTACTGAA GACTGCCCCA TCCCATGCAG CGGCTCGTGG CTCCGCGATG TGTGGGATTG 6300 
GGTTTGCACC ATCCTAACAG ACTTTAAAAA CTGGCTGACC TCCAAATTGT TCCCAAAGAT 6360 
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GCCTGGTCTC CCCTTTATCT CTTGTCAAAA GGGGTACAAG GGCGTGTGGG CTGGCACTGG 6420 
TATCATGACC ACACGGTGTC CTTGCGGCGC CAATATCTCT GGCAAT6TCC GCCT6GGCTC 6480 
CATGAGAATT ACGGGGCCCA AAACCTGCAT GAATATCTGG CA6GGGACCT TTCCCATCAA 6540 
TTGTTACACG GAGGGCCAGT GCGTGCCGAA ACCCGCACCA AACTTTAAGA TCGCCATCTG 6600 
GAG6GTGGCG GCCTCA6AGT ACGCGGAGGT GACGCAGCAC GGGTCATACC ACTACATAAC 6660 
AGGACTTACC ACTGATAACT TGAAAGTTCC TTGCCAACTA CCTTCTCCAG AGTTCTTTTC 6720 
CTG6GTG6AC GGAGTGCAflA TCCATAGGTT TGCCCCCATA CCGAAGCCGT TTTTTC6GGA 6780 
TGAGGTCTCG TTCTGCGTTG GGCTTAATTC ATTTGTCGTC GGGTCTCAGC TCCCTTGCGA 6840 
TCCrGAACCT GACACAGACG TATTGACGTC CATGCTAACA GACCCATCCC ATATCACGGC 6900 
GGAGACTGCA GCGCGGCGTT TGGCACGGGG GTCACCCCCG TCCGAGGCAA GCTCCTCAGC 6960 
GAGCCAGCTA TCGGCACCAT CGCTGCGAGC CACCTGCACC ACCCAC6BCA AGGCCTATGA 7020 
1GTGGACATG GTGGATGCCA ACCTflTTCAT GGGGGGCGAT GTGACCCGGA TAGAGTCT6A 7080 
GTCCAAAGTG GTCGTTCTGG ACTCTCTCGA CCCAATGGTC GAAGAAAGGA GCGACCTTGA 7140 
GCCTTCGATA CCATCG6AA7 ATATGCTCCC CAAGAAGAGA TTCCCACCAG CCTTACCGGC 7200 
TTGBGCACGG CCTGATTACA ACCCACCGCT TGTGGAATCG TGGAAGAGGC CAGATTACCA 7260 
ACCGGCCACT GTTGCGGGC7 GCGCTCTCCC CCCCCCTAAG AAAACCCCGA CGCCTCCCCC 7320 
AAGGAGACGC CGGACAGTGG GTCTGAGTGA GAGCTCCATA GCAGATGCCC TACAACAfiCT 7380 
GGCCATCAAG TCCTTTGGCC AGCCCCCCCC AAGC6GCGAT TCAGGCCTTT CCACGGGGGC 7440 
GGACGCAGCC GATTCCGGCA GTCGGACGCC CCCCGATGAG TTGGCCCTTT CGGAGACAGG 7500 
TTCCATCTCC TCCATGCCCC CTCTCGAGGG GGA6CC7GGA GATCCAGACT TGGAGCCTGA 7560 
GCAGGTAGAG CTTCAACCTC CCCCCCAGGG GGGGGTGGTA ACCCCCGGCT CAGGCTCGGG 7620 
GTCTT6GTCT ACTTGCTCCG AGGAGGACGA CTCCGTCGTG TGCTGCfCCA TCTCATACTC 7680 
CTGGACCGGG GCTCTAATAA CTCCTIGTAG CCCCGAAGAG GAAAAGTTGC CAATTGGCCC 7740 
CTTGAGCAAC TCCCTGTTGC GATATCACAA CAAGGTGTAC TGTACCACAT CAAAGAGCGC 7800 
CTCA1TAAGG GCTAAAAAGG TAACTTTTGA TAGGATGCAA GCGCTCGACG CTCA7TATGA 7860 
CTCAGTCTTG AAGGACATTA AGCTAGCGGC CTCCAAGGTC ACC6CAAGGC 7TCTCACTTT 7920 
AGAGGAfiGCC TGCCAGTTAA CTCCACCCCA CTCTGCAAGA TCCAAGFATG GGTTTGGGGC 7980 
TAAGGAGGTC CGCAGCTTGT CCGGGAGAGC CGTTAACCAC ATCAAGTCCG FGTGGAAGGA 8040 
CCTCCTGGAA GACACACAAA CACCAATTCC TACAACCATC ATGGCCAAAA ATGAGGTGT7 8100 
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CTGCGTGGAC CCCACCAAGG G6GGTAAGAA AGCAGCTCGC CTTATCGTTT ACCCTGACCT 8160 
CGGCGTCAGG GTCTGCGAGA AAATGGCCCT TTATGATATC ACACAAAAGC TTCCTCAGGC 8220 
GGTGAT6GGG GCTTCTTATG GATTCCAGTA CTCCCCCGCT CAGCGGGTGG AGTTTCTCTT 8280 
GAAGGCATGG GCGGAAAAGA AAGACCCTAT GGGTTTTTCG TATGATACCC GATGCTTTGA 8840 
CTCAACCGTC ACTGAGAGAG ACATCAGGAC TGAGGAGTCC ATATATCGGG CTTGTTCCTT 8400 
GCCCGAGGAG GCCCACACTG CCATACACTC ACTGACTGAG AGACTTTACG TGGGAGGGCC 8460 
CATGTTCAAC AGCAAGGGCC AGACCTGCGG GTACAGGCGT TGCCGCGCCA GCGGGGTGCT 8520 
TACCACTAGC ATGGGGAACA CCATCACATG CTATGTGAAA GCCTTAGCGG CCTGTAAGGC 8580 
TGCAGGGATA ATTGCGCCCA CAATGCTGGT ATGCGGCGAT GACTTGGTTG TCATCTCAGA 8640 
GAGCCAGGGG ACCGAGGAGG ACGAGCGGAA CCTGAGAGCC TTCACGGAGG CTATGACCAG 8700 
GTATTCTGCC CCTCCTGGTG ACCCCCCCAG ACCGGAATAT GACCTGGAGC TGATAACATC 8760 
TTGCTCCTCA AATGTGTCTG TGGCGTTGGG CCCACAAGGC CGCCGCAGAT ACTACCTGAC 8820 
CAGAGACCCT ACCACTCCAA TCGCCCGGGC TGCCTGGGAA ACAGTTAGAC ACTCCCCTGT 8880 
CAATTCATGG CTAGGAAACA TCATCCAGTA CGCCCCAACC ATATGGGCTC GCATGGTCCT 8940 
GATGACACAC TTCTTCTCCA TTCTCATGGC CCAAGATACT CTGGACCAGA ACCTCAACTT 9000 
TGAGATGTAC GGAGCGGTGT ACTCCGTGAG TCCCTTGGAC CTCCCAGCCA TAATTGAAAG 9060 
GTTACACGGG CTTGACGCTT TCTCTCTGCA CACATACACT CCCCACGAAC TGACACGGGT 9120 
GGCTTCAGCC CTCAGAAAAC TTGGGGCGCC ACCCCTCAGA GCGTGGAAGA GCCGGGCACG 9180 
TGCAGTCAGG GCGTCCCTCA TCTCCCGTGG GGGGAGAGCG GCCGTTTGCG GCCGATATCT 9240 
CTFCAACTGG GCGGTGAAGA CCAAGCTCAA AC7CACTCCA TTGCCGGAAG CGCGCCTCCT 9300 
GGATTTATCC AGCTGGTTCA CTGTCGGCGC CGGCGGGGGC GACATTTATC ACAGCGTGTC 9360 
GCGTGCCCGA CCCCGCTTAT TACTCCTTGG CCTACTCCTA CTTTTTGTAG GGGTAGGCCT 9420 
TTTCCTACTC CCCGCTCGGT AGAGCGGCAC ACATTAGCTA CACTCCATAG CTAACTGTCC 9480 
CTFTTTTTir TTTTTTTTTT TTTTTTTTTT TTTTTTTm TTTTTTTTTT TTTTTTTTTT 9540 
TTTTTTTTTT TTTTTTTTTT TTTTTTTTTT TTTTTTTTTT TTTTTTTTT 9589 

[0023] IE5>J$^ : 3 K?l|0>HSg rcDNA to genomic RNA 

MOTS* : 397 0 LtzH& : E 

K5))<nS : 40 i3Ji] 
M«>ft : -*ss 
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GGCATTACCC 


CTGCACAGTT 


AACTATACCA 


TCGAGCACAG 


GCTCACGGCT 


GCGTGCAATT 


ACAGAGACAG 


AAGTCAACTG 


TCTCCTTTGC 


CTTGCACTTA 


CTCGGACCTG 


CCCGCCTTGT 


TCGTGGACGT 


GCAATTCATG 


TATGGCCTAT 


GGGAGTGGGT 


AGTACTCTTA 


TTCCTGCTCT 


GGATGCTCAT 


CTTGTTGGGC 


CAGGCCGAAG 


CTGCGAGCGC 


AGCTAGCTGC 


AATGGCTTCC 


GGTACATCAA 


GGGTCGGGTA 


GTCCCCTTGG 


TTGGCCTACT 


GCTCCTAGCA 


TTGCCCCAAC 


GTCAGATAGG 


AGCAGCTCTG 


TTGGTACTGA 


AGACCCTTCT 


CAGCCGGTTT 


CTGTGGTGGT 


TGGTCCAGGA 


GTGGGCACCA 


CCTATGCAGG 


CCGTCGCCAT 


ATTCTGCCCG 


GGTGTGGTGT 


TTGGGCCTGC 


TTATCTCCTA 


AAAGGTGCTT 


ACGCTCTACT 


AAGGATGTGC 


ACCATGGTAA 


TGGTGCTACT 


AGCCCTTGGC 


AGGTGGACTG 


TGTCGGATTG 


GGCTGCTAAT 


GGCCTGCGGG 


TCAGTCCGAT 


GGAGAAAAAA 


GTCATCGTCT 



TCTTCAAAAT AA6GATGTAT GTGGGGGGGG 60 
TCACTCGTGG GGATCGTTGC AACTTGGAGG 120 
TGCACTCCAC CACGGAGTGG GCCATTTTAC 180 
CGACTGGTCT TCTCCACCTC CACCAAAACA 240 
CACCTGCTCT CACAAAATAC ATCGTCCGAT 300 
TAGCGGACGC CAGGGTTTGC GCCTGCTTAT 360 
CAGCACTAGA GAAGTTGGTC GTCTTGCACG 420 
TATACTTTGT CATCTTTTTC GTGGCTGCTT 480 
CTACTTATTC CCTCACTGGC CTATGGTCCT 540 
AGGCTTATGC TTATGACGCA TCTGTACATG 600 
TCACTCTCTT TACACTCACC CCCGGGTATA 660 
TGTGCTATCT TCIGACCCTG GCGGAAGC7A 720 
TGCGCGGTGG CCGTGATGGG ATCATATGGG 780 
TTGACATAAC CAAGTGGCTC TTGGCGGTGC 840 
TGACGCGTGT GCCGTACTTC GTCAGGGCTC 900 
GGCATCTCGC GGGGGGTAGG TACGTCCAGA 960 
GCACTTACAT CTATGACCAC CTCACCCCTA 1020 
ACTTGGCGGT CGCCGTGGAG CCTATCATCT 1080 
GGGGAGCGGA GACAGCTGCT TGCGGGGATA 1140 
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TCTTACACG5 ACTTCCCGTG TCCGCCCGAC 77GGCCGGGA GGTCC7CCTT GGCCCAGC7G 1200 
ATGGCTATAC CTCCAAGGGG 7GGAGTC77C TCGCCCCCAT CACTGCTTAT GCCCAGCAGA 1260 
CACGCGGCCT 7T7GGGCACC ATA6TGGTGA GCATGAC6GG GCGCGACAAG ACAGAACAGG 1320 
CCGGGGAGAT 7CAGG7CC7G TCCACGGTCA CTCAGTCCTT CCTCG6AACA ACCATCTCGG 1380 
GGGTCTTATG GACTGTCTAC CATGGAGCTG GCAACAAGAC TCTAGCCGGC TCACGGGGTC 1440 
CGGTCACACA GATGTACTCC AGTGCTGAGG GGGACTTAGT GGGGTGGCCC AGCCCCCCC6 1500 
GGACCAAATC TTTGGAGCCG TGCACGTGTG GAGCGG7CGA CCTATACCTG GTCACGCGAA 1560 
ACGCTGATGT CATCCCGGCT CGAAGAC6CG GGGACAAGCG AGGAGCGCTA C7C7CCCCGA 1620 
GACCTCTTTC CACCTTGAAG GGGTCCTCGG GGGGCCC6GT GCTCTGCCCC AGAGGCCACG 1680 
CTGTCGGGGT CTTCCGGGCA GCC6TGTGCT CCCGGGGCGT GGCCAAGTCC AFAGATTTTA 1740 
7CCCCGTTGA GACACTTGAC ATCGTCACTC GGTCCCCCAC CTTTAGTGAC AACAGCACAC 1800 
CACCTGCTGT GCCCCAAACT TATCAGGTCG GGTACTTACA TGCCCCGACT GGTAGTGGAA 1860 
AGAGCACCAA AGTCCCTG7C GCGTATGCC6 CTCAGGGGTA CAAAGTGCTA GTGCTTAATC 1920 
CCTCGGTGGC TGCCACCCTG GGG7TTGGGG CGTAC77GTC CAAGGCACAT GGCA7CAA7C 1980 
CCAACATTAG GACTGGGGTC AGGACTG7GA CGACCGGGGC GCCCA7CACG TAC7CCACA7 2040 
A7GGCAAA77 CC7CGCCGAT GGGGGCTGCG CAGGCGGCGC CTATGACA7C ATCA7ATGCG 2100 
A76AATGCCA 7GCCGTGGAC 7C7ACCACCA TTCTCG6CA7 CGGAACAG7C CTCGA7CAAG 2160 
CAfiAGACAGC CGGGG7CAGG C7AAC7G7AC TGGCTACGGC 7ACGCCCCCC GGG7CAG7GA 2220 
CAACCCCCCA CCCCAACA7A GAGGAGGTGG CCCTCGGGCA GGAGGG7GAG ATCCCC77CT 2280 
A7GGGAGGGC GAncCCC7G 7CA7ACA7CA AGGGAGGAAG ACACT7GA7C T7C7GCCACT 2340 
CAAAGAAAAA GTGTGACGAG C7CGCGGCGG CCCT7CGGGG TA7GGGCT7G AACGCAGTGG 2400 
CATAC7ACAG AGGGC7GGAC GTC7CCG7AA 7ACCAAC7CA GGGAGACG7A G7GGTCG7CG 2460 
CCACCGACGC CC7CATGACG GGG777ACTG GAGAC777GA C7CCG7GA7C GAC7GCAACG 2520 
1AGC6G7CAC 7CAAGT7G7A GAC77CAGCT TGGACCCCAC A77CACCA7A ACCACACAGA 2580 
CTG7CCCTCA AGACGC7G7C TCACG7AGCC AGCGCCGGGG CCGCACGGGC AGGGGAAGAC 2640 
TG6G7ATT7A 7AGG7ATG7T 7CCACTGG7G AGCGAGCC7C AGGAATG777 GACAG7GTAG 2700 
7GCTC7GCGA GTGC7ACGA7 GCAGGGGCCG CA7GG7A7GA GC7CACACCA GCGGAGACCA 2760 
CCGTCAGGC7 CAGAGCA7A7 T7CAACACAC CTGGTT7GCC TG7G76CCAA GACCATC7TG 2820 
AG77TTGGGA GCAG7T77C ACCGGCC7CA CACACA7AGA TGCCCAC77C C777CCCAAA 2880 
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CAAAGCAATC GGGGGAAAAT TTCGCATACT 
GGGCCAAAGC CCCCCCCCCG TCCTGGGACG 
CCACACTCGT GGGCCCCACA CCTCTCCTGT 
CCCTCACGCA TCCTGTGACG AAATACATCG 
TGACCAGCAC GTGGGTCTTA GCTGGGGGGG 
CGACCGGGTG TGTTTGCATC ATCGGCCGCT 
CGGACAAGGA GGTCCTCTAT GAGGCTITTG 
CTCTCATTGA AGAGGGGCAG CGGATAGCCG 
TGCAGCAAGC TTCCAAACAA GCTCAAGACA 
AGGTAGAGCA ATTCTGGGCC AAACACATGT 
CAGGACTATC AACACTGCCA GGGAACCCTG 
CCCTCACCAG 7CCGTTGTCA ACTAGCACCA 
TAGCATCCCA AATTGCGCCT CCCGCGGGGG 
GGGCTGCCGT AGGCAGCATA GGCTTGGGTA 
GTGCGGGCAT TTCGGGGGCT CTCGTCGCAT 
T6GAGGATGT TGTCAACCTG CTGCCTGGAA 
7CATCTGCGC GGCCATCCTG CGCCGACACG 
TGAATAGGCT CATTGCCTTT GCTTCCAGAG 
CGGAGTCGGA 3970 

[0024] fB5"J## : 4 
IB?iJ©fiS : 269 3 

&m<Dm ■. mm 

SSfflJS : 

h*Pv-:iSKtt 



TAACAGCCTA CCAGGCTACA GTGTGCGCTA 2940 
TCATGTGGAA GTGTTTGACT CGACTCAAGC 3000 
ACCGCTTGGG CTCTGTTACC AACGAGGTCA 3060 
CCACCTGCAT GCAAGCCGAC CTTGAGGTCA 3120 
TCTTGGCGGC CGTCGCCGCG TACTGCCT6G 3180 
TGCACGTTAA CCAGCGAGCC GTCGTTGCAC 3240 
ATGAGATGGA GGAATGTGCC TCTAGAGCGG 3300 
AGATGCTGAA GTCCAAGATC CAAGGCTTAT 3360 
TACAACCCGC TGTGCAGGCT TCTTGGCCCA 3420 
GGAACTTCAT CAGCGGCATT CAATACCTCG 3480 
CT6TAGCTTC CATGATGGCA TTCAGTGCCG 3540 
CTATCCTTCT CAACATTTTG GGGGGCTGGC 3600 
CTACCG6CTT CGTCGTCAGT GGCCTGGTGG 3660 
AGGTGCTGGT GGACATCCTG GCAGGGTATG 3720 
TCAAGATCAT GTCTGGCGAG AAGCCCTCCA 3780 
TTCTGTCTCC GGGTGCCCTG GT6GTGGGAG 3840 
TGGGACCGGG GGAAGGCGCT GTCCAATGGA 3900 
GAAACCACGT CGCCCCCACC CACTACGTGA 3960 

SB5»lroag : cDNA to genomic R N A 
30 ft®£ik&Uz*& ■ E 
B35ii 
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ATTCTGTCTC 


CGGGTGCCC7 


GGTGGTGGGA 


G7GGGACCGG 


GGGAAGGCGC 


TGTCCAA7GG 


GGAAACCACG 


TCGCCCCCAC 


CCACTACGTG 


CAACTAC7TG 


GCTCCCTTAC 


CATAACCAGC 


GAAGACTGCC 


CCATCCCATG 


CAGCGGCTCG 


ACCATCC7AA 


CAGACTTTAA 


AAACTGGCTG 


CTCCCCTTTA 


TCTCTTGTCA 


AAAGGGGTAC 


ACCACACGGT 


GTCCTTGCGG 


C6CCAATATC 


AT7ACGGGGC 


CCAAAACCTG 


CATGAATATC 


ACGGAGGGCC 


AGTGCGTGCC 


GAAACCCGCA 


GCGGCCTCAG 


AGTACGCGGA 


GGTGACGCAG 


ACCACTGATA 


ACTTGAAAGT 


TCCTTGCCAA 


GACGGAGTGC 


AGATCCATAG 


GTTTGCCCCC 


TCGTTCTGCG 


TTGG6CTTAA 


TTCATTTGTC 


CC7GACACAG 


ACGTATTGAC 


GTCCATGCTA 


GCftGCGCGGC 


GTTT6GCACG 


GGGG7CACCC 


CTATCGGCAC 


CATCGCTGCG 


AGCCACCTGC 


ATGGTGGATG 


CCAACCTGTT 


CATGGGGGGC 


GrGGTCGTTC 


TGGACTCTCT 


CGACCCAATG 



GTCATCTGCG CGGCCATCCT 6CGCCGACAC 60 
ATGAATAGGC TCAT76CCTT TGCTTCCAGA 120 
ACGGAGTCGG ATGCGTCGCA GCGTGTGACC 180 
CTGCTCAGGA GACTCCACAA CTGGATTACT 240 
TGGCTCCGCG ATGTGTGGGA TTGGGTTTGC 300 
ACCTCCAAAT TGTTCCCAAA GATGCCTGGT 360 
AAGGGCGTGT GGGCTGGCAC TGGTATCATG 420 
TCTGGCAATG TCCGCCTGGG CTCCATGAGA 480 
TGGCAGGGGA CC777CCCA7 CAATTGTTAC 540 
CCAAACTTTA AGATCGCCAT CTGGAGGG7G 600 
CACGGGTCA7 ACCAC7ACA7 AACA6GACTT 660 
C7ACC77C7C CAGAGT7CTT TTCC7GGG7G 720 
ATACCGAAGC CGT7TT77CG GGA7GAGG7C U0 
GTCGGGTC7C AGC7CCC7TG CGATCC76AA 840 
ACAGACCCA7 CCCATA7CAC GGCGGAGACT 900 
CCGTCCGAGG CAAGC7CCTC AGCGAGCCAG 960 
ACCACCCACG GCAAG6CC7A TGA7GTG6AC 1020 
GATG76ACCC GGA7AGAGTC TGAG7CCAAA 1080 
GTCGAAGAAA GGAGCGACC7 7GAGCC7TCG 1140 
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ATAHPATPfifi 

n 1 nvvnl KtWJ 


AATATATGCT 

Art In IttlUwl 


CCCCAAGAAG 




ACAACCCACC 


GCTTGTGGAA 


APTGTTGCGG 


GCTGCGCTCT 

Mv 1 VlvVv i v i 


CCCCCCCCCT 


CGCCGGACAG 


TGGGTCTGAG 


TGAGAGCTCC 


AAGTCCTTTG 


GCCAGCCCCC 

\IV vn v v v v v +j 


CCCAAGCGGC 


GCCGATTCCG 


GCAGTCGGAC 


GCCCCCCGAT 


TCCTCCATGC 


CCCCTCTCGA 


GGGGGAGCCT 


GAGCTTCAAC 


CTCCCCCCCA 


GGGGGGGGTG 


TCTACTTGCT 


CCGAGGAG6A 


CGACTCCGTC 


GGGGCTCTAA 


TAACTCCTTG 


TAGCCCCGAA 


AACTCCCTGT 


TGCGATATCA 

t u vun ini vn 


CAACAAGGTG 

vnnvnn«i« ■ vi 


AGGGCTAAAA 


AGGTAACTTT 


TGATAGGATG 


7TGAAGGACA 


TTAAGCTAGC 

1 1 nfillu 1 nw 


GGCCTCCAAG 


flfTTGCCAGT 

VI vv IDuunU I 


TAACTCCACC 


CCACTCTGCA 


GTCCGCAGCT 


TGTCCGGGAG 


AGCCGTTAAC 


GAAGACACAC 

V fin Ufl Vn v n v 


AAACACCAAT 


TCCTACAACC 


GACCCCACCA 


AGGGGGGTAA 


GAAAGCAGCT 


AGGGTCTGCG 


AGAAAATGGC 


CCTTTATGAT 


GGGGCTTCTT 


ATGGATTCCA 


GTACTCCCCC 

u i nv i w wv 




AGAAAGACrr 

n linn nun v vv 


TATGGGTTTT 


GTCACTGAGA 


GAGACATCAG 


GACTGAGGAG 


GAGGCCCACA 


CTGCCATACA 


CTCACTGACT 


AACAGCAAGG 


GCCAGACCTG 


CGGGfACAGG 


AGCATGGGGA 


ACACCATCAC 


ATGCTATGTG 


ATAATTGCGC 


CCACAATGCT 


GGTATGCGGC 


GGGACCGAGG 


AGGACGAGCG 


GAACCTGAGA 


[002 5] ffijlt## 


: 5 
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AGATTCCCAC CAGCCTTACC GGCTTGGGCA 1200 
TCGTGGAAGA GGCCAGATTA CCAACCGGCC 1260 
AAGAAAACCC CGACGCCTCC CCCAAGGAGA 1320 
ATAGCAGATG CCCTACAACA GCTGGCCATC 1380 
GATTCAGGCC TTTCCACGGG GGCGGACGCA 1440 
GAGTTGGCCC TTTCGGAGAC AGGTTCCATC 15G0 
GGAGATCCAG ACTTGGA6CC TGAGCAGGTA 1560 
GTAACCCCCG GCTCAGGCTC GGGGTCTTGG 1620 
GTGTGCTGCT CCATGTCATA CTCCTGGACC 1680 
GAGGAAAAGT TGCCAATTGG CCCCTTGAGC 1740 
TACTGTACCA CATCAAAGAG CGCCTCATTA 1800 
CAAGCGCTCG ACGCTCATTA TGACTCAGTC 1860 
GTCACCGCAA GGCTTCTCAC TTTAGAGGAG 1920 
AGATCCAAGT ATGGGTTTGG GGCTAAGGAG 1980 
CACATCAAGT CCGTGTGGAA GGACCFCCTG 2040 
ATCATGGCCA AAAATGAGGT GTTCTGCGTG 2100 
CGCCTTATCG TTTACCCTGA CCTCGGCGTC 2160 
ATCACACAAA AGCTTCCTCA GGCGGIGAIG 2220 
GCTCAGCGGG TGGAGTTTCT CTTGAAGGCA 2280 
TCGTATGATA CCCGATGCTT TGACTCAACC 2340 
TCCATATATC GGGCTTGTTC CTTGCCCGAG 2400 
GAGAGACTTT ACGTGGGAGG 6CCCATGTTC 2460 
CGTTGCCGCG CCAGCGGGGT GCTTACCACT 2520 
AAAGCCTTAG CGGCCTGTAA GGCTGCAGGG 2580 
GATGACTTGG TTGTCATCTC AGAGAGCCAG 2640 
GCCTTCACGG AGGCTATGAC CAG 2693 
40 : sea 
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Het Ser Thr Asn Pro Lys Pro Gin Arg Us Thr Lys Arg Asn Thr 

5 10 IS 

Asn Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Giy Gin He 

20 25 30 

Val Gly Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly 

35 40 45 

Va! Arg Ala Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly 

50 55 60 

Arg Arg Gin Pro He Pro Lys Asp Arg Arg Ser Thr Gly Lys Ser 

65 70 75 

irp Gly Lys Pro Gly Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly 

id 85 90 

Leu Gly Trp Ala Gly Trp Leu Leu Ser Pro Arg Gly Ser Arg Pro 

95 100 105 

Ser Trp Gly Pro Asn Asp Pro Arg His Arg Ser Arg Asn Val Gly 
110 115 120 

Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu Het Gly 
125 130 135 

Tyr He Pro Val Val Gly Ala Pro Leu Gly Gly Val Ala Ars Ala 
140 145 150 

Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Phe Ala 
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155 160 165 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala 

170 175 180 

Leu Leu Ser Cys lie Thr Thr Pro Val Ser Ala Ala Glu Val Lys 

185 190 195 

Asn He Ser Thr Gly Tyr Met Val Thr Asn Asp Cys Thr Asn Asp 

200 205 210 

Ser lie Thr Trp Gin Leu Gin Ala Ala Val Leu His Val Pro Gly 

215 220 225 

Cys Val Pro Cys Glu Lys Val Gly Asn Thr Ser Arg cys Trp He 

230 235 240 

Pro Val Ser Pro Asn Val Ala Val Gin Gin Pro Gly Ala Leu Thr 

245 250 255 
Gin fily leu Aro Thr His He Asn Met Val Val Het Ser Ala Thr 

260 265 270 

leu Cys Ser Ala Leu Tyr Val Gly Asp Leu Cys Gly Gly Val Het 

275 280 285 

Leu Ala Ala Gin Het Phe He Val Ser Pro Gin His His Trp Phe 

290 295 300 
Val Gin Asp Cys Asn Cys Ser He Tyr Pro Gly Thr He Thr Gly 

305 310 315 

His Arg Het Ala Trp Asp Het Het Het Asn Trp Ser Pro Thr Ala 

320 325 330 

Thr Het He Leu Ala Tyr Afa Het Arg Val Pro Glu Val He He 

335 340 345 

Asp He He fily Gly Ala His Trp Gly Val Het Phe Gly Leu Ala 

350 355 360 

Tyr Phe Ser Het Gin Giy Ala Trp Ala Lys Val Val Val He Leu 

365 370 375 



-duty } ^ * ; 
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leu Leu Ala Ata Gly Val Asp Afa Gin Thr His Thr Val Gly Gfy 

340 385 390 

Ser Thr Ala His Asn Ala Arg Thr Leu Thr Gly Het Phe Ser Leu 

395 400 405 

Gly Ala Arg Gin Lys He Gin Leu lie Asn Thr Asn Gly Ser Trp 

410 415 420 

His He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu His Thr 

425 430 435 

Gly Phe Leu Ala Ser Leu Phe Tyr Thr His ser Phe Asn Ser Ser 

440 445 450 

Gly Cys fro &iu Arfl Het Ser Ala Cys Arg Ser He Glu Ala Phe 

455 460 465 

Arg Val Gly Trp Gly Ala Leu Gin Tyr Glu Asp Asn Val Thr Asn 

470 475 480 

Pro Glu Asp Het Arg Pro Tyr Cys Trp His Tyr Pro Pro Arg Gin 

485 490 495 

Cys Gly Val Val Ser Ala Ser Ser Val Cys Gly Pro Val Tyr Cys 

500 505 510 

Phe Thr Pro ser Pro Val val Val Gly Thr Thr Asp Arg Leu Gly 

515 520 525 

Ala Pro Thr Tyr Thr Trp Gly Glu Asn Glu Thr Asp Val Phe Leu 

530 535 540 

Leu Asn Ser Thr Arg Pro Pro Gin Gly Ser Trp Phe Gly Cys Thr 

545 550 555 

Trp Het Asn Ser Thr Gly Tyr Thr Lys Thr Cys Gly Ala Pro Pro 

560 565 570 

Cys Arg Me Arg Ala Asp Phe Asn Ala Ser Het Asp Leu Leu Cys 

575 580 585 

Pro Thr Asp Cys Phe Arg Lys His Pro Asp Thr Thr Tyr He Lys 
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590 595 600 

Cys Gly Ser Gly Pro Trp Lea Thr Pro Arg Cys Leu lie Asp Tyr 

$05 610 615 

Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Tyr Thr lie 

620 625 630 

Phe Lys He Arg Net Tyr Va! Gly Gly Vai Glu His Arg Leu Thr 

635 640 645 

Ala Ala Cys Asn Phe Thr Arg Gly Asp Arg Cys Asn Leu Glu Asp 

650 655 660 

Arg Asp Arg Ser Gin Leu Ser Pro Leu Leu His ser Thr Thr Glu 

665 670 675 

Trp Ala He Leu Pro Cys Thr Tyr Ser Asp Leu Pro Ala Leu Ser 

680 685 690 

Thr Gly Leu Leu His Leu His Gin Asn lie Val Asp Val Gin Phe 

695 700 705 

Met Tyr Gly Leu Ser Pro Ala Leu Thr Lys Tyr He Val Arg Trp 

710 715 720 

Glu Trp Val Val Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg Val 

725 730 735 

Cys Ala Cys Leu Trp Het Leu He Leu Leu Gly Gin Ala Glu Ala 

740 745 750 

Ala Leu Giu Lys Leu Val Val Leu His Ala Ala Ser Ala Ala Ser 

755 760 76S 

Cys Asn Gly Phe Leu Tyr Phe Val He Phe Phe Val Ala Ala Trp 

770 775 780 

Tyr He Lys Gly Arg Val Val Pro Leu Ala Thr Tyr Ser Leu Thr 

785 790 795 

Gly Leu Trp Ser Phe Gly Leu Leu Leu Leu Ala Leu Pro Gin Gin 

800 805 810 
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Ala Tyr Ala Tyr Asp Ala Ser Val His Gly Gin He Gly Ala Ala 
a 15 820 825 

Leu Leu Val Leu Jle Thr Leu Phe Thr Leu Thr Pro Gly Tyr Lys 
830 835 840 

Thr Leu Leu Ser Am Phe Lea Trp Trp Leu Cys Tyr Leu Leu Thr 

845 850 855 

Leu Ala GUi Ala Het val Gin GIu Trp Ala Pro Pro net Gin Val 

860 865 870 

Aro Gly Gly Arg Asp Gly He He Trp Ala Val Ala He Phe Cys 

875 880 885 

Pro Gly Val Val Phe Asp He Thr Lys Trp Leu Leu Ala Val Leu 

890 895 900 

Gly Pro Ala Tyr Leu Leu Lys Gly Ala Leu Thr Arg Val Pro Tyr 

90S 910 915 

Phe Val Arg Ala His Ala Leu Leu Am Het Cys Thr Het Val Arg 

920 925 930 

((is Leu Ala Gly Gly Arg Tyr Val Gin Het Val Leu Leu Ala Leu 

935 940 945 

Gly Arg Trp Thr Gly Thr Tyr lie Tyr Asp His Leu Thr Pro Het 

950 955 960 

Ser asp Trp Ala Ala Asn Gly Leu Arg Asp Leu Ala Val Ala Val 

965 9 70 975 ' 

GIu Pro He He Phe Ser Pro Met GIu Lys Lys Val He Val Trp 
980 985 990 

Gly Ala GIu Thr Ala Ala Cys Gly Asp He Leu His Gly Leu Pro 
995 1000 1005 

Val Ser Ala Arg Leu Gly Arg GIu Val Leu Leu Giy Pro Ala Asp 

1010 1015 1020 

Gly Tyr Thr Ser Lys Gly Trp Ser Leu Leu Ala Pro lie Thr Ala 



( 68 ) 



t#M¥06-121689 



1025 1030 1035 

Tyr Ala 61 n Gfn Thr Arg Gly Leu Leu Gly Thr I le Val Val Ser 

1040 1045 1050 

Het Thr Gly Arg ASP Lys Thr Glu Gin Ala Gly Glu He Glu Val 

10S5 1060 1065 

Leu Ser Thr Val Thr Gin Ser Phe Leu Gly Thr Thr He Ser Gly 

1070 10?5 1080 

Val Leu Trp Thr Val Tyr His Gly Ala Gly Asn Lys Thr Leu Ala 

1045 1090 1095 

Gly Ser Are Gly Pro Val Thr Gin Het Tyr ser Ser Ala Glu Gly 

1100 1105 mo 

Asp Leu Val Gly Trp Pro Ser Pro Pro Gly Thr Lys Ser Leu Glu 

1115 1120 1125 

Pro cys Thr Cys Gly Ala Val Asp Leu Tyr Leu Val Thr Arg Asn 

1130 1135 1140 

Ala Asp Val I le Pro Ala Arg Arg Arg Gly Asp Lys Arg Gly Afa 

1145 1150 1155 

Leu Leu Ser Pro Arg Pro Leu Ser Thr Leu Lys Gly Ser Ser Gly 

1160 1165 1170 

Gly Pro Vai Leu Cys Pro Arg Gly His Ala Val Gly Val Phe Arg 

1175 1180 1185 

Ala Ala Val Cys Ser Arg Gly Val Ala Lys Ser He Asp Phe lie 

1190 1195 1200 

Pro Val Glu Thr Leu Asp He Val Thr Arg Ser Pro Thr Phe Ser 

1205 1210 1215 

Asp Asn Ser Thr Pro Pro Ala Val Pro Gin Thr Tyr Gin Val Gin 

122Q 1225 1230 

Tyr Leu His Ala Pro Thr Gly Ser Gly Lys Ser Thr Lys Val Pro 

1235 1240 1245 
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Vaf Ala Tyr Ala Ala Gin Gly Tyr Lys Val Leu Vai Leu Asn Pro 

1250 1255 1260 

ser Val Ala Ala Thr Leu Gly Phe Gly Ala Tyr Leu Ser Lys Ala 

1265 12T0 1275 

His Gly He Asn Pro Asn He Arg Thr Gly Val Arg Thr Val Thr 

12*0 1285 1290 

Thr Sly Ala Pro He Thr Tyr Ser Thr Tyr Gly Lys Phe Leu Ala 

1295 1300 1305 

Asp Gly Gly Cys Ala Gly Gly Ala Tyr Asp He He He Cys Asp 

1310 1315 1320 

Glu Cys His Ala Val Asp Ser Thr Thr He Leu Gly lie Gly Thr 

1325 1330 1335 

Val Leu Asp Gin Ala Glu Thr Ala Gly Val Arg Leu Thr val Leu 

1340 1345 1350 

Ala Thr Ala Thr Pro Pro Gly Ser Val Thr Thr Pro His Pro Asn 

1355 1360 1365 

lie Glu Glu Val Ala Leu Gly Gin Glu Gly Glu He Pro Phe Tyr 

1370 1375 1380 

fily Arg Ala He Pro Leu Ser Tyr He Lys Gly Gly Arg His Leu 

1385 1390 1395 

He Phe Cys His Ser Lys Lys Lys cys Asp Glu Leu Ala Ala Ala 

1400 1405 1410 

Leu Arg Gly Het Gly Leu Asn Ala Val Ala Tyr Tyr Arg Gly Leu 

1415 1420 1425 

Asp Val Ser Val He Pro Thr Gin Gly Asp Vai Val Val Val Ala 

1430 1435 1440 

Thr Asp Ala Leu Het Thr Gly Phe Thr Gly Asp Phe Asp Ser val 

1445 1450 1455 

He Asp Cys Asn Val Ala Val Thr Gin Val Val Asp Phe Ser Leu 
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1460 1465 1470 

Asp Pro Thr Phe Thr He Thr Thr Gin Thr Val Pro Gin Asp Ala 

1475 1480 1485 

Val Ser Arg Ser Gin Am Arg Gly Arg Thr Gly Arg Gly Arg Leu 

1490 1495 1500 

Gly He Tyr Arg Tyr Val Ser Thr Gly Glu Arg Aia Ser Gly Het 
1505 1510 1515 

Phe Asp Ser Val Val Leu cys Glu cys Tyr Asp Ala Gly Ala Ala 

1520 1525 1530 

Trp Tyr Glu Leu Thr Pro Ala Glu Thr Thr val Arg Leu Arg Ala 

1535 1540 1545 

Tyr Phe Asn Thr Pro Gly Leu Pro Val Cys Gin Asp His Leu Glu 

1550 1555 1560 

Phe Trp Glu Ala Val Phe Thr Gly Leu Thr His lie Asp Ala His 

1565 1570 1575 

Phe Leu Ser Gin Thr Lys Gin Ser Gly Glu Asn Phe Ala Tyr Leu 

1580 1585 1590 

Thr Ala Tyr Gin Ala Thr Val Cys Ala Arg Ala Lys Ala Pro Pro 

1595 1600 1605 

Pro Ser Trp Asp Val Het Trp Lys Cys Leu Thr Arg Leu Lys Pro 

1610 1615 1620 

Trp Leu Val Gly Pro Thr Pro Leu Leu Tyr Arg Leu Gly Ser Val 

1625 1630 1635 

Thr Asn Glu Val Thr Leu Thr His Pro Val Thr Lys Tyr He Ala 
1640 1645 1650 

Thr Cys Het Gin Ala Asp Leu Glu Val Het Thr Ser Thr Trp Val 
1655 1660 1665 

Leu Ala Gly Gly Val Leu Ala Ala Val Ala Ala Tyr Cys Leu Ala 
1670 1675 1680 
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Thr Gly Cys Val Cys He Me Gly Arg Leu His Val Asn Gin Arg 

16&5 1690 1695 

Ala Val Val Ala Pro Asp Lys Glu Val Leu Tyr Of u Ala Phe Asp 

1700 1705 1710 

Glu Het Glu Glu Cys Ala Ser Arg Ala Ala Leu He Glu Glu Gly 

1715 1720 1725 

Gin Arg He Ala Giu Met Leu Lys Ser Lys He G!n Gly Leu Leu 

1730 1735 1740 

Gin Gin Ala Ser Lys Gin Ala Gin Asp He Gin Pro Ala Val Gin 

1745 1750 1755 

Ala Ser Trp Pro Lys Va! Glu Gin Phe Trp Ala Lys His Het Trp 

1760 1765 1770 

Asn Phe He Ser Gly He Gin Tyr Leu Ala Gly Leu Ser Thr Leu 

1775 1780 1785 

Pro Gly Asn Pro Ala Val Ala Ser Het Het Ala Phe Ser Ala Ala 

1700 1795 1800 

Leu Thr ser Pro Leu Ser Thr Ser Thr Thr He Leu Leu Asn He 

1805 1810 1815 

Leu Gly Gly Trp Leu Ala Ser Gin He Ala Pro Pro Ala Gly Ala 

1820 1825 1830 

Thr Gly Phe Val Val Ser Gly Leu Val Gly Ala Ala Val Gly ser 

1835 1840 1845 

He Gly Leu Gly Lys Val Leu Val Asp lie Leu Ala Gly Tyr flly 

1850 1855 1860 

Ala Gly He Ser Gly Ala Leu Val Ala Phe Lys He Het Ser Gly 

1865 U70 1875 

Glu Lys Pro Ser Het Glu Asp Val Val Asn Leu Leu Pro Gly He 

1880 1885 1890 

Le« Ser Pro Gly Ala Leu Val Val Gly Val He Cys Ala Aia He 
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1895 1900 1905 

Leu Arg Arg His Vai Gly Pro Gly Glu 6fy Ala Vat Gin Trp Het 

1910 1915 1920 
Asa Arc Leu He Ala Phe Ala Ser Arg Gly Asn His Val Ata Pro 

1925 1930 1935 

Thr His Typ Val Thr Glu Ser Asp Ala Ser Gin Arg val Thr Gin 

1940 1945 1950 

Leu Leu Gly ser Leu Thr He Thr Ser Leu Leu Arg Arg Leu His 

1955 I960 1965 
Asn Trp He Thr Glu Asp Cys Pro lie Pro Cys Ser Gly Ser Trp 

1970 1975 1980 

Leu Arg Asp Val Trp Asp Trp Val Cys Thr I le Leu Thr Asp Phe 

1985 1990 1995 

Lys Asn Trp Leu Thr Ser Lys Leu Phe Pro Lys Het Pro Gly Leu 

2000 2005 2010 

Pro Phe Ite Ser Cys Gin Lys Gly Tyr Lys Gly Val Trp Ala Gly 

2015 2020 2025 

Thr Gly He Het Thr Thr Arg Cys Pro Cys Gly Ala Asn lie Ser 

2030 2035 2040 

Gly Asn Val Arg Leu Gly Ser Het Arg He Thr Gly Pro Lys Thr 

2045 2050 2055 

Cys Met Asn He Trp Gin Gly Thr Phe Pro He Asn Cys Tyr Thr 

2060 2065 2070 

Glu Gly 6ln Cys Val Pro Lys Pro Ala Pro Asn Phe Lys He Ala 

2075 2080 2085 

lie Trp Arg Val Ala Ala Ser Glu Tyr Ala Glu Val Thr Gin His 

2090 2095 2100 

Gly Ser Tyr His Tyr He Thr Gly Leu Thr Thr Asp Asn Leu Lys 

2105 2110 2115 
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Val Pro Cys Gin Leu Pro Ser Pro filu Phe Phe Set- Trp Val Asp 

2120 2125 2130 

Gly Val Gin He His Arg Phe Ala Pro He Pro Lys Pro Phe Phe 

2135 2140 2145 

Arg Asp Glu Va! Ser Phe Cys Val flly Leu Asn Ser Phe Va! Val 

2150 2155 2160 

Gly Ser Gin Leu Pro Cys Asp Pro Glu Pro Asp Thr Asp Val Leu 

2165 2170 2175 

Thr Ser Het Leu Thr Asp Pro Ser His lie Thr Ala Glu Thr Ala 

21GQ 2185 2190 

Ala Arg Arg Leu Ala Are. Gly Ser Pro Pro ser Glu Ala Ser Ser 

2195 2200 2205 

Ser Ala Ser Gin Leu Ser Ala Pro Ser Leu Ara Ala Thr Cys Thr 

2210 2215 2220 

Thr II is Gly Lys Ala Tyr Asp U\ Asp Het Val Asp Ala Asn Leu 

2225 2230 2235 

Phe Het Gly Gly Asp Val Thr Arg He Glu Ser Glu ser Lys Val 

2240 2245 2250 

Val Val Leu Asp ser Leu Asp Pro Het val Glu Glu Arg Ser Asp 

2255 2260 2265 

Leu Glu Pro Ser lie Pro Ser Glu Tyr Het Leu Pro Lys lys Arg 

2270 2275 2280 

Phe Pro Pro Ala Leu Pro Ala Trp Ala Arg Pro Asp Tyr Asn Pro 

2285 2290 2295 

Pro Leu Val Glu Ser Trp Lys Arg Pro Asp Tyr Gin Pro Ala Thr 

2300 2305 2310 

Vat Ala Gly Cys Ala Leu Pro Pro Pro Lys Lys Thr Pro Thr Pro 

2315 2320 2325 

Pro Pro Arg Arg Am Arg Thr Val Gly Leu Ser filu Ser Ser He 
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2350 2335 2340 

Ala Asp Ala Leu Gin Sin Leu Ala He Lys Ser Phe 6fy Gin Pro 
2345 2350 2355 

Pro Pro Ser Gfy Asp Ser Gfy Leu Ser Thr Gfy Ala Asp Ala Ala 
2360 2365 2370 

Asp Ser Gly Ser Arg Thr Pro Pro Asp Glu Leu Ala Leu Ser Glu 
2375 2380 2385 

Thr Gly Ser He Ser Ser Het Pro Pro Leu Glu Gly Glu Pro Gly 

2380 2395 2400 

Asp Pro Asp Leu Glu Pro Glu Gin val Glu Leu Gin Pro Pro Pro 

2405 2410 2415 

Gin Gly Gly Val Val Thr Pro Gly Ser Gly Ser Gly Ser Trp Ser 

2420 2425 2430 

Thr Cys Ser Glu Glu Asp Asp Ser Val Val Cys Cys Ser Het Ser 

2435 2440 2445 

Tyr Ser Trp Thr Gly Ala Leu lie Thr Pro cys Ser Pro Glu Glu 

2450 2455 2460 

Glu Lys Leu Pro He Asn Pro Leu Ser Asn Ser Leu Leu Arg Tyr 

2465 24 H> 2475 

His Asn Lys Val Tyr Cys Thr Thr Ser Lys Ser Ala Ser Leu Arfl 

2480 2485 2490 

Ala Lys Lys Val Thr Phe Asp Arg Het Gin Ala Leu Asp Ala His 

2495 2500 2505 

Tyr Asp Ser Val Leu Lys Asp He Lys Leu Ala Ala Ser Lys Val 

2510 2515 2520 

Thr Ala Arg Leu Leu Thr Leu Glu Glu Ala Cys Gin Leu Thr Pro 

2525 2530 2535 

Pro His Ser Ala Arg Ser Lys Tyr Gly Phe Gly Ala Lys Glu Val 

40 
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2540 2545 2550 

Arg Ser Leu Ser Gly Arg Ala Val Asn His lie Lys Ser Val Trp 

2555 2560 2565 

Lys Asp Leu Leu Glu Asp Thr Gin Thr Pre He Pro Thr Thr He 

2570 2575 2580 

Het Ala Lys Asn Glu val Phe cys Val Asp Pro Thr Lys Gly Gly 

2535 2590 2595 

Lys Lys Ala Ala Arg Leu lie Val Tyr Pro Asp Leu Gly Val Arg 

2600 2605 2610 

Val Cys Glu Lys Het Ala Leu Tyr Asp He Thr Gin Lys Leu Pro 

2615 2620 2625 

Gin Ala Val Het Gly Aia Ser Tyr Gly Phe Gin Tyr Ser Pro Ala 

2630 2635 2640 

Gin Arg Val Glu Phe Leu Leu Lys Ala Trp Ala Glu Lys Lys Asp 

2645 2650 2655 

Pro Het Gly Phe ser Tyr Asp Thr Arg Cys Phe Asp Ser Thr Val 

2660 2665 2670 

Thr Glu Arg Asp He Arg Thr Glu Glu Ser He Tyr Arg Ala Cys 

2675 2660 2635 

Ser Leu Pro Glu Glu Ala His Thr Ala He His Ser Leu Thr Glu 

2690 2695 2700 

Arg Leu Tyr Val Gly Gly Pro Het Phe Asn ser Lys Gly Gin Thr 

Z705 2710 2715 

Cys Gly Tyr Arg Arg Cys Arg Ala Ser Gly Val Leu Thr Thr Ser 

2720 2725 2730 

Het Gly Asn Thr He Thr Cys Tyr Val Lys Ala Leu Ala Ala Cys 

2735 2740 2745 

Lys Ala Ala Gly He He Ala Pro Thr Het Leu Val Cys Gly Asp 

40 
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2750 2755 2760 

Asp Leu Val Val lie Ser Glu Sep Gin Gly Thr Glu Glu Asp filu 

2765 2770 2 775 

Arfl Asn Leu Arg Ala Phe Thr Glu Ata Het Thr Arg Typ Ser Ala 

2780 2785 2790 

Pro Pro Gly Asp Pro Pro Arg; Pro Glu Tyr Asp Leu Glu Leu He 

2795 2*00 2805 

Thr Ser Cys Ser Ser Asn Val Ser val Ala Leu Gly Pro Gin Gly 

2810 2815 2820 

Arc Arg Arg Tyr Tyr Leu Thr Arg Asp Pro Thr Thr Pro lie Ala 

2825 2830 2835 

Ara Ala Ala Trp Glu Thr Val Arg His Ser Pro Val Asn Ser Trp 

2840 2845 2850 

Leu Gly Asn He He Gin Tyr Ala Pro Thr He Trp Ala Arg Het 

2855 2860 2865 

Val Leu Het Thr His Phe Phe Ser He Leu Het Ala Gin Asp Thr 

2870 2875 2880 

Leu Asp Gin Asn Leu A&n Phe Glu Het Tyr Gly Ala Val Tyr Ser 

2885 2890 2895 

Val Ser Pro Leu Asp Leu Pro Ala He He Glu Arg Leu His Gly 

2900 2905 2910 

Leu Asp Ala Phe Ser Leu His Thr Tyr Thr Pro His Glu Leu Thr 

2915 2920 2925 

Arg Val Ala Sep Ala Leu Arg Lys Leu Gly Ala Pro Pro Leu Arg 

2930 2935 2940 

Ala Trp Lys Ser Arg Ala Arg Ala val Arg Ala Ser Leu He Ser 

2945 2950 2955 

Arg Gly Gly Arg Ala Ala Val Cys Gly Arg Tyr Leu Phe Asn Trp 

40 
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2960 2965 2970 

Ala Val Lys Thr Lys Leu Lys Lea Thr Pro Leu Pro Gtu Ala Arg 

2975 2980 2985 

Leu Leu Asp Leu Ser Ser Trp Phe Thr Val Gly Ala Gly Gly Gly 

2990 2995 3000 

Asp lie Tyr His Ser Val Ser Arg Ala Arg Pro Arg Leu Leu Leu 

3005 3010 3015 

Leu Gly Leu Leu Leu Leu Phe Val Gly Val Gly Leu Ptie Leu Leu 



3020 

Pro Ala Arg 
3033 
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